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I. Basis of the opinion 

1 . With regard to the elements of the international application (Replacement sheets which have been furnished to 
the receiving Office in response to an invitation under Article 14 are referred to in this opinion as "originally 
filed"): 



Description, Pages 

1 -59 as originally filed 

Claims, Numbers 

1 -28 - as originally filed 

Drawings, Sheets 

1/12-12/12 as originally filed 

Sequence listing part of the description, pages: 

1 -7, as originally filed 

2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b». 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under 
Rule 55.2 and/br 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

S contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

□ furnished subsequently to this Authority in computer readable form. 

□ The statement that the subsequently furnished written sequence listing does not go beyond the disclosure 
in the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 

□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

5. □- This opinion has been established as if (some of) the amendments had not been made, since they have 

been considered to go beyond the disclosure as filed (Rule 70.2(c)). 
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6. Additional observations, if necessary: 

V. Reasoned statement under Rule 66.2(a)(ii) with regard to novelty, inventive step or industrial 
applicability; citations and explanations supporting such statement 

1. Statement 

Novelty (N) Claims 1-28 

Inventive step (IS) Claims 1 -28 

Industrial applicability (IA) Claims 

2. Citations and explanations 
see separate sheet 
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SEPARATE SHEET 



D1: DATABASE EBI [Online] EMBL; 1 August 1998 (1998-08-01) LIN X.: 

'Putative Beta-Glucosidase.' Database accession no. 064883 XP002203312 

D2: DATABASE EBI [Online] EMBL; 14 October 1997 (1997-10-14) MALBOOBI 
M.A. ET AL.: 'Arabidopsis thaliana beta-glucosidase mRNA, complete cds.' 
Database accession no. U72153 XP002203313 

D3: STOTZ HENRIK U ET AL: 'Induced plant defense responses against 
chewing insects. Ethylene signaling reduces resistance of Arabidopsis 
against Egyptian cotton worm but not diamondback moth.' PLANT 
PHYSIOLOGY (ROCKVILLE), vol. 1 24, no. 3, November 2000 (2000-1 1 ), 
pages 1007-1017, XP001 085235 ISSN: 0032-0889 

D4: MALBOOBI M A ET AL: 'A PHOSPHATE-STARVATION INDUCIBLE BETA- 
GLUCOSIDASE GENE (PSR3.2) ISOLATED FROM ARABIDOPSIS 
THALIANA IS A MEMBER OF A DISTINCT SUBFAMILY OF THE BGA 
FAMILY' PLANT MOLECULAR BIOLOGY, NIJHOFF PUBLISHERS, 
DORDRECHT, NL, vol. 34, no. 1, 1 May 1997 (1997-05-01), pages 57-68, 
XP002061 130 ISSN: 0167-4412 

ad V. 

1. Novelty (Article 33(2) PCT) 

1 .1 . The present application is concerned with polynucleotides encoding beta- 
glucosidases; also encompassed are constructs harbouring such molecules, and 
methods to use them. 

1 .2. D1 discloses a putative beta-glucosidase from Arabidopsis thaliana, which 
sequence is 100% identical to the claimed sequences. 

D2 also refers to a beta-glucosidase from Arabidopsis thaliana being more than 
60% identical to the sequences of the invention. 

D3 and D4 both comprise documents that describe plant defense mechanisms 
due to novel beta glucosidase genes in Arabidopsis thaliana. 

1 .3. It follows that in view of D1 novelty can not be acknowledged for the subject- 
matter of claims 1 -4, and 11.. 

Novelty can also not be acknowledged in view of D2-D4 due to the vague wording 
of respective claim 1, which refers also to "fragments.. .having beta-glucosidase 
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activity" and "hybridizing polynucleotides" which can not be distinguished from the 
molecules described in D2-D4. 

Thus, novelty can not be acknowledged for claims 1-4, and 1 1 in view of D2, and 
for claims 1-28 in view of D3-D4. 

1 .4. Please note that claim 24 refers to "compounds" that have been identified or 
obtained by the methods of claims 1 8-23. 

These not further defined compounds can not be distinguished from any already 
known compounds in the art, and thus do not fulfill the requirements of Article 
33(2) PCT. 

Furthermore, such "reach-through" claims are not accepted under the regulations 
of the European Patent Office. 

2. Inventive Step (Article 33(3) PCT) 

2.1 . Applicants should also note that the general principle of beta-glucosidases in plant 
defense responses have been known in the art. 

Thus, it appears that the problem to be solved is the provision of further 
sequences encoding beta-glucosidases being involved in plant defense 
mechanisms. 

The provision of further sequences can however not be regarded as involving an 
inventive step. 

2.2. Thus, the subject-matter of claims 1 -28 does not seem to satisfy the requirements 
of Article 33(3) PCT. 
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CLAIMS 

1 . A polynucleotide selected from the group consisting of 

(a) polynucleotides comprising a nucleotide sequence encoding a 
polypeptide with the amino acid sequence of SEQ ID NO:2; 

(b) polynucleotides comprising the nucleotide sequence shown in SEQ ID 
NO:1; 

(c) polynucleotides comprising a nucleotide sequence encoding a fragment 
of the polypeptide encoded by a polynucleotide of (a) or (b), wherein 
said nucleotide sequence encodes a protein having (5-glucosidase 
activity; 

(d) polynucleotides comprising a nucleotide sequence the complementary 
strand of which hybridizes to the polynucleotide of any one of (a) to (c), 
wherein said nucleotide sequence encodes a protein having p- 
glucosidase activity; and 

(e) polynucleotides comprising a nucleotide sequence that deviates from 
the nucleotide sequence defined in (d) by the degeneracy of the genetic 
code; 

2. The polynucleotide of claim 1 which is DNA or RNA. 

3. A recombinant nucleic acid molecule comprising the polynucleotide of claim 1 
or 2. 

4. The recombinant nucleic acid molecule of claim 3 further comprising 
expression control sequences operably linked to said polynucleotide. 

5. A vector comprising a polynucleotide of claim 1 or 2 or the recombinant nucleic 
acid molecule of claim 3 or 4. 



6. 



The vector of claim 5 further comprising expression control sequences 
operably linked to said polynucleotide. 
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7. A method for producing genetically engineered host cells comprising 
introducing the polynucleotide of claim 1 or 2, the recombinant nucleic acid 
molecule of claim 3 or 4 or the vector of claim 5 or 6 into a host cell. 

8. A host cell which is genetically engineered with the polynucleotide of claim 1 or 
2, the recombinant nucleic acid molecule of claim 3 or 4 or the vector of claim 
5 or 6 or obtainable by the method of claim 7. 

9. The host cell of claim 8 which is a bacterial, yeast, fungus, plant or animal cell. 

10. A method for the production of a polypeptide encoded by a polynucleotide of 
claim 1 or 2 in which the host cell of claim 8 or 9 is cultivated under conditions 
allowing for the expression of the polypeptide and in which the polypeptide is 
isolated from the cells and/or the culture medium. 

11. A polypeptide encoded by the polynucleotide of claim 1 or 2 or obtainable by 
the method of claim 10. 

12. An antibody specifically recognizing the polypeptide of claim 1 1 . 

13. A method for producing a transgenic plant comprising the steps of 

(a) introducing the polynucleotide of claim 1 or 2, the recombinant nucleic 
acid molecule of claim 3 or 4 or the vector of claim 5 or 6 into the 
genome of a plant cell; and 

(b) regenerating the cell of (a) to a transgenic plant. 

14. A transgenic plant or plant tissue comprising the plant cells of claim 9 or 
obtainable by the method of claim 1 3. 

15. A transgenic plant which shows an increased activity of the polypeptide 
encoded by the polynucleotide of claim 1 or 2 compared to a corresponding 
wild-type plant. 
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16. The transgenic plant of claim 14 or 15 which, upon an increased activity of the 
protein encoded by the polynucleotide of claim 1 or 2, shows an increased 
resistance against a plant pathogen to which a corresponding wild-type plant is 
susceptible. 

17. Propagation material or harvestable parts of the transgenic plant of any one of 
claims 14 to 16 comprising plant cells of claim 9. 

18. A method for conferring pathogen resistance or increased pathogen resistance 
to a plant comprising the step of providing a transgenic plant in which the 
activity of the polypeptide encoded by the polynucleotide of claim 1 or 2 is 
increased. 

19. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) contacting a candidate compound with said polypeptide under 
conditions where said polypeptide is active; and 

(b) determining whether said candidate compound is hydrolyzed by said 
polypeptide. 

20. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) providing a 3-dimensional structure model of said polypeptide; and 

(b) determining the structure of a substrate that fits into the 3-dimensional 



21. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) providing a mutant protein of said polypeptide the catalytic activity of 
which is abolished without loosing substrate binding activity; 

(b) contacting a candidate compound with said mutant protein; and 

(c) determining whether the candidate compound is bound by said mutant 



structure model of (a). 



protein. 
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22. The method of any one of claims 19 to 21 furthermore comprising the step of 
determining whether the identified compound or a hydrolysis product thereof is 
capable of inducing or enhancing a defence response against a pathogen in a 
plant. 

23. A method for preparing a plant protection composition comprising the steps of 
the method of claims 19, 20, 21 and/or 22 and furthermore the step of 
formulating the identified compound or a hydrolysis product thereof in a form 
suitable for administering to plants. 

24. A compound, a hydrolysis product thereof or a plant protection composition 
identified or obtained by the method of any one of claims 18 to 23. 

25. A kit comprising the polynucleotide of claim 1 or 2, the recombinant nucleic 
acid molecule of claim 3 or 4, the vector of claim 5 or 6, the polypeptide of 
claim 11, the antibody of claim 12 or the compound or a hydrolysis product 
thereof of claim 24. 

26. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6, of the host cell of claim 8 
or 9, of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for identifying a compound that 
is hydrolyzed by a polypeptide encoded by said polynucleotide. 

27. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6, of the host cell of claim 8 
or 9 or of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for the preparation of a plant 
protection composition. 

28. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6 f of the host cell of claim 8 
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or 9 or of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for establishing or enhancing a 
pathogen resistance in plants. 
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Ad V. 

1 . Novelty (Article 33(2) PCT) 

1.1. Claims 1-8, 10-20, and 24-26 are formally regarded new in view of the amended 
set of claims filed with the letter dated April 7th 2004, introducing a heterologous 
promoter operatively linked to said polynucleotides. 

1 .2. Claim 9 however, is not new since the claimed polypeptide (not comprising the 
promoter anymore) can not be distinguished from the deduced polypeptide of D1 being 
1 00% identical over the entire length, i.e. 560 amino acids. 

1 :3. The same applies to the subject-matter of claims 21 , 22, and 23. 
Claim 22 is a so-called reach-through-claim, being directed to 

'down-stream* inventions, i.e. not yet identified inventions. As such it is not possible to 
ascertain whether the claimed compounds have been known, respectively which 
compounds fall within the scope of the claim. The mere fact that the PIN2 polypeptide 
comprises a unique amino acid combination in its catalytic core does not necessitate 
that novel, and especially which novel compounds bind to such enzymes. 
Thus, claim 22 does not meet the requirements of Articles 33(2)PCT, Article 5 PCT, 
and Article 6 PCT. 

The same applies to therefrom dependent claim 23, inasfar it concerns said 
compounds. 

Claim 21 represents a hidden reach-through claim, since it comprises the step of 
formulating said non-allowable compounds (in order to apply them to plants). 

2. Inventive step (Article 33(3) PCT) 

2.1. Claims 1-13, 15, 17, 18, 19, and 24. 

These claims refer to - modifications of - the known beta-glucosidase of D1 . 

The linkage of a heterologous promoter to a known polypeptide, or the use of a vector 
to express these polynucleotides, and also the generation, respectively the use of 
transgenic plants using known polynucleotides is regarded as obvious modifications, 
respectively applications commonly practised in the art. 

Moreover, applicants should note that the discovery of an inherent feature of a known 
polynucleotide (i.e. conferring resistance) does neither render them novel, nor inventive. 

2.2. Claims 14, 16, 20, and 25. 

Inventive step can however be allowed for the subject-matter of claims 14, 16, 20, 25 
and 26. 

D3 is regarded as closest prior art. 

This document describes a beta-glucosidase of Arabidopsis thaliana in plant defence 
mechanisms. The protein of D3 displays 44% overall identity to the claimed enzymes. 

The problem to be solved is therefore regarded as the provision of a further beta- 
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While D1 describes exactly the enzyme of the present application there is no linkage to 
D3, or any mentioning of using the enzyme of D1 in plant defence mechanisms. 

Therefore, claims 14, 16, 20, 25, and 26 are considered to meet the requirements of 
Article 33(3) PCT. 
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This is in response to the Written Opinion dated January 14, 2004: 

Enclosed please find a set of new claims 1 to 26 based on which further international 
preliminary examination shall be carried out. 

1. Amendments to the claims 

1.1 New claim 1 corresponds to original claim 1 except for the following 
amendments: 

(i) The claim has been rephrased so as to refer to recombinant nucleic 
acid molecules comprising the polynucleotides of the invention as 
defined in sections (a) to (e) and, operatively linked thereto, a 
heterologous promoter. As support for this amendment, we herewith 
point to original claims 3 and 4 and the description on page 15, 2 nd 
paragraph and page 15, last paragraph to page 16, 2 nd paragraph. 

(ii) Original section (d) has been replaced by new section (d) referring to 
polynucleotides which are defined by a sequence identity of at least 
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50% of the encoded polypeptide to the amino acid sequence encoded 
by the polynucleotide of section (a) or (b). This amendment is 
supported by the description on page 10, lines 17 to 22. 

1 .2 Original claims 2 and 3 have been deleted. 

1 .3 New claim 2 corresponds to original claim 4 with the exception that its wording 
has been adapted to new claim 1 by inserting the word "additional" before 
"expression control sequences". This modification takes into account that the 
recombinant nucleic acid molecule of claim 1 already contains with the 
promoter an expression control sequence operatively linked to the comprised 
polynucleotide of the invention. 

1.4 New claims 3 to 26 correspond to original claims 5 to 28 except that any 
references in these claims to the polynucleotide of the invention have been 
adapted to the wording of new claim 1 by using the phrase "polynucleotide 
defined in claim 1". 

2. Novelty (Article 33(2) PCT) 

2.1 0 64883 (DP 

2.1.1 In sections 1.2 and 1.3 of the Written Opinion, the Examiner raises a novelty 
objection against claims 1 to 4 based on O 64883 (DP because of an alleged 
100% sequence identity between the amino acid sequence disclosed in D1 
and SEQ ID NO:2. 

2.1.2 With respect to claim 1, we submit that this objection does not apply to new 
claim 1 . The amino acid sequence shown in Dl is a deduction from an open 
reading frame identified in the course of sequencing the Arabidopsis genome 
as being a putative beta-glucosidase. This is evident from the cross-reference 
to the database entry AC004521 depicting part of the genomic sequence of 
Arabidopsis (see also the description of the present application on page 7, 1 st 
paragraph). We note that neither Dl nor AC004521 discloses a recombinant 
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nucleic acid molecule as defined in new claim 1 . Thus, new claim 1 is novel 
over Dl and AC004521 . 

2.1 .3 With respect to objected claims 2 to 4, it is submitted that original claims 2 and 
3 have been deleted and that new claim 2 (corresponding to original claim 4) 
derives novelty over Dl from new claim 1 from which said claim depends. 

2.2 U 721 53 (D2) 

2.2.1 In sections 1.2 and 1.3 of the Written Opinion, claims 1 to 4 and 11 are 
furthermore objected to for lack of novelty over U 72153 (D2) . The Examiner 
points in this regard to an alleged sequence identity of more than 60% 
between the protein disclosed in D2 and SEQ ID NO: 2 and to an allegedly 
vague wording in sections (c) and (d), referring to fragments and hybridizing 
polynucleotides. 

2.2.2 With respect to new claim 1 , however, we submit that this objection lacks any 
basis. Specifically, section (c) of claim 1 refers to fragments of the amino acid 
sequence shown in SEQ ID NO: 2 which have beta-glucosidase activity. Such 
fragments are not contained in the amino acid sequence disclosed in D2. As 
evidence, we herewith enclose as Exhibit 1 alignments between the coding 
sequences and the amino acid sequences each presented in the present 
application (denoted by "Dl") and in D2. As can be seen in the amino acid 
alignment, the longest overlap with 100% sequence identity has a length of 14 
amino acids (see positions 162 to 175 of the D2 sequence). Such a fragment 
is clearly too short to contain all the required elements (e.g. catalytic domain 
and substrate binding sites) in a functional three-dimensional conformation so 
as to show beta-glucosidase activity. Thus, D2 does not disclose a fragment 
as defined in claim 1 (c). 

2.2.3 Section (d) of new claim 1 now refers to polypeptides having a sequence 
identity of at least 50% to the polypeptide encoded by the polynucleotide of 
section (a) or (b), i.e. the full-length amino acid sequence of SEQ ID NO: 2. 
From the amino acid sequence alignment shown in Exhibit 1 , it is derivable 
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that the amino acid sequences of D2 and SEQ ID NO: 2 have an identity of 
43%. Thus, the "at least 50%" feature in claim 1 (d) clearly limits the claimed 
polynucleotides from D2. 

2.2.4 In view of the above, we note that new claim 1 is novel over D2. A further 
reason is that D2 does not disclose recombinant nucleic acid molecules 
comprising a polynucleotide encoding the beta-glucosidase and a 
heterologous promoter. Furthermore, we note that new claim 2 derives novelty 
over D2 from claim 1 from which it depends. 

2.2.5 With regard to new claims 9 (corresponding to original claim 11), it is 
submitted that the features delimiting the polynucleotides defined in claim 1 
from D2 (see sections 2.2.2 and 2.2.3, supra) are likewise present in the 
definition of the claimed polypeptide. Thus, new claim 9 is novel over D2. 

2.3 Stotz (D3) 

2.3.1 Under items 1.2 and 1.3 of the Written Opinion, a further novelty objection is 
raised against claims 1 to 28 based on Stotz (D3) . Also in this context, the 
Examiner points to an allegedly vague wording in sections (c) and (d) of claim 
1 , referring to fragments and hybridizing polynucleotides. 

2.3.2 However, we do not consider this objection applicable to new claim 1. As 
support, we herewith point to enclosed Exhibit 2 containing alignments 
between the coding sequences and the amino acid sequences each 
presented in the present application (denoted as "D1") and in D3. The amino 
acid sequence alignment of Exhibit 2 shows that the longest overlap with 
100% sequence identity has a length of 13 amino acids. Such a fragment is 
clearly too short to constitute an active beta-glucosidase (see also section 
2.2.2, supra). Thus, D3 does not disclose a fragment having beta-glucosidase 
activity as defined in claim 1(c). Furthermore, the two amino acid sequences 
have a sequence identity of 44% which is clearly below the value of 50% 
mentioned in claim 1(d). Moreover, D3 does not disclose a recombinant 
nucleic acid molecule containing a p-glucosidase sequence. For these 
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reasons, new claim 1 is novel over D3. The same applies mutatis mutandis to 
new claims 2 and 9 (corresponding to original claims 4 and 11). 

2.3.3 With respect to new claims 3 to 8 and 10 to 26 (corresponding to the also 
objected original claims 5 to 10 and 12 to 28), we submit that the claimed 
subject-matters are not disclosed in D3. We direct the attention of the 
Examiner to the fact that D3 describes the identification of the beta- 
glucosidase BGL1 from Arabidopsis (see D3, page 1015, left column, "Gene 
Isolation" and Figure 9 depicting its amino acid sequence). Furthermore, D3 
describes Northern blot experiments in which total RNA is hybridised with a 
probe consisting of a partial BGL1 cDNA (see D3, page 1015, left column, 
lines 7 and 8 of the section "Gene Expression Analysis"). However, D3 clearly 
does not describe vectors, host cells, transgenic plants or antibodies or 
methods and used for any one of these products. Thus, new claims 3 to 8 and 
10 to 26 are novel over D3. 

2.4 Malboobi (D4) 

2.4.1 Under items 1.2 and 1.3 of the Written Opinion, a further novelty objection is 
raised against claims 1 to 28 based on Malboobi (D4) . Also in this context, the 
Examiner points to an alleged vague wording in sections (c) and (d) of claim 1 . 

2.4.2 This objection does not apply to new claim 1. In support, we herewith point to 
enclosed Exhibit 3 containing alignments between the coding sequences and 
the amino acid sequences each presented in the present application (denoted 
as "Dl") and in D4. The amino acid sequence alignment of Exhibit 3 shows 
that the longest overlap with 100% sequence identity has a length of 13 amino 
acids. Such a fragment is clearly too short to constitute an active beta- 
glucosidase (see also section 2.2.2, supra). Thus, D4 does not disclose a 
fragment having beta-glucosidase activity as defined in claim 1 (c). 
Furthermore, the two amino acid sequences have a sequence identity of 44% 
which is clearly below the value 50% mentioned in claim 1(d). For these 
reasons, new claim 1 is novel over D4. 
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2.4.3 With respect to new claims 2 to 7 and 9 (corresponding to objected original 
claims 4 to 9 and 1 1), we note that these claims directly or indirectly refer back 
to new claim 1 and, thus, derive novelty over D4 from claim 1 . 

2.4.4 With respect to new claims 8 and 10 to 26 (corresponding to objected original 
claims 10 and 12 to 28), we submit that none of the claimed subject-matters is 
disclosed in D4. We direct the attention of the Examiner to the fact that D4 
describes the isolation of the Arabidopsis gene encoding the beta-glucosidase 
PSR3.2 by using Brassica nigra cDNA encoding PSR3.1 as a probe in a 
hybridisation screening (see D4, the paragraph bridging pages 58 and 59). 
Figure 2 of D4 shows the nucleotide and amino acid sequence of PSR3.2 
which was subcloned in the E. coli-propagated vector pBSK in order to being 
sequenced. Furthermore, D4 describes Northern blot experiments in order to 
elucidate PSR3.2 gene expression in different tissues and depending on 
phosphate starvation (Figures 4 to 6). 

However, D4 definitely does not describe the subject-matter of new claims 8 
and 10 to 26 encompassing a method for the production of a polypeptide, 
antibodies, transgenic plants and related methods and uses applying the 
products of the invention. Thus, new claims 8 and 10 to 26 are novel over D4. 

2.5 Claim 22 

2.5.1 Under item 1.4 of the Written Opinion, the Examiner objects that claim 24 
(corresponding to new claim 22) which is directed to compounds identified as 
a substrate of the protein of the invention in any one of the methods of claims 
19 to 23 or to hydrolysis products thereof or plant protection compositions 
comprising said compound or product lacks novelty over known substances. 

2.5.2 This objection should not be justified. As evidence, we herewith refer the 
Examiner to Table 1 on page 6 of the description. In this table, the catalytically 
active amino acid residues and the amino acid residues determining specificity 
for aglucone and glucose binding of 47 Arabidopsis family 1 glycosyl 
hydrolases (including PEN2, the enzyme of the invention, in row 9) are 
compared. Importantly, PEN2 has a unique combination of residues 
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determining aglucone binding ("agl. sp. n ) which is not found in any of the other 
46 enzymes. Thus, it can be expected that the substrate of PEN2 and, 
consequently, also the hydrolysis products thereof is unique within the 
corresponding compounds for which the other enzymes are specific. This 
provides clear evidence that the compound and hydrolysis product thereof 
referred to in new claim 1, as well as compositions comprising these, are not 
known in the prior art. Thus, new claim 22 is novel. 

3. Inventive step (Article 33(3) PCT) 

3.1 Under items 2 to 2.2 of the Written Opinion, the Examiner objects that the 
subject-matter of claims 1 to 28 does not involve inventive step. The Examiner 
justifies this view by the fact that the general principle that beta-glucosidases 
are involved in plant defense responses has been known in the art. The 
technical problem to be solved is therefore seen in the provision of further 
sequences encoding beta-glucosidases being involved in plant defense 
mechanisms. This provision is contended to lack inventive step. 

3.2 We disagree with this assessment of the Examiner. In particular, we submit 
that the formulation of the technical problem requires an adequate 
appreciation of the contribution of the invention to the art as well as of the prior 
art knowledge as is discussed in the following. 

3.2.1 The present invention contributes to the art PEN2 which is a beta-glucosidase 
that can be used in order to establish or improve resistance, in particular non- 
host resistance, in plants. This enzyme was identified by screening through 
randomly mutagenized Arabidopsis plants by using an approach that was not 
known before the priority date. Specifically, tens of thousands of M2 plants 
were examined for a significantly increased incidence of hypersensitive 
responses upon treatment with a non-host pathogen (see, e.g. Example 2). 
Autofluorescence of epidermal cells was used as an indicator for a 
hypersensitive response taken place. In this way, a plant in which the gene 
encoding PEN2 is knocked-out was identified and the gene was isolated via 
map-based cloning. The knock-out plants were shown to have a considerably 
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reduced resistance against a multitude of non-host pathogens, both in static 
and in inducible non-host resistance mechanisms (see, e.g. Example 3). This 
provides convincing evidence for PEN2's central role in mediating non-host 
resistance mechanisms. In view of this, transgenic plants in which PEN2 is 
overexpressed as well as tools for obtaining such plants such as the 
recombinant nucleic acid molecule of claim 1 are considered to be of great 
value. The same is to be expected for the substrate or hydrolysis product 
thereof which is expected to be of use in triggering endogenous resistance 
mechanisms. 

3.2.2 D3 can be considered the closest prior art. It teaches that a beta-glucosidase 
of Arabidopsis which shows 44% sequence identity with SEQ ID NO: 2 (see 
item 2.3.2, supra) might be involved in resistance mechanisms against 
insects. The other cited prior art documents seem to be still less relevant with 
respect to the technical problem solved by the invention because Dl and D2 
do not at all mention any functions of the respective beta-glucosidase in 
connection with plant defense against stresses and D4 concerns abiotic 
stress. 

3.2.3 In view of the closest prior art, the technical problem may be seen in the 
provision of means and methods for establishing a broad, in particular non- 
host resistance in plants. This formulation duly appreciates the difference 
between the closest prior art and the invention. In particular, it does not 
contain any portion of the solution as does the formulation in the Written 
Opinion by referring to beta-glucosidase. Any connection between non-host 
resistance and beta-glucosidase is not derivable from the cited documents. 

3.3 The claimed solution to the technical problem would not have been obvious in 
view of the prior art. Contrary to the view of the Examiner, a person skilled in 
the art would not have believed that beta-glucosidases are in general involved 
in plant defense mechanisms. As evidence, we refer to the following prior art 
documents showing that these enzymes were known to also have different 
functions: 
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(i) beta-glucosidases hydrolyzing cell wall polysaccharides: 

- Gerard! et al., Purification and characterization of a beta- 
glucosidase abundantly expressed in ripe sweet cherry (Prunus 
avium L.) fruit. Plant Science 160(5) (2001), 795-805 (enclosed as 
D5) 

- Leah et al., Biochemical and Molecular Characterization of a Barley 
Seed p-Glucosidase. Journal of the Institute of Brewing 101(3) 
(1 995), 1 54 (enclosed as D6) 

(ii) beta-glucosidases involved in hormone metabolism: 

- Brzobohatv et al., Release of active cytokinin by a beta-glucosidase 
localized to the maize root meristem. Science 262 (5136) (1993), 
1051-1054 (enclosed as D7) 

- Dietz et al., Extracellular beta-glucosidase activity in barley involved 
in the hydrolysis of ABA glucose conjugate in leaves. Journal of 
Experimental Botany 51(346) (2000), 937-944 (enclosed as D8) 

(iii) beta-glucosidases involved in secondary metabolism: 

- Dharmawardhana et al., A beta-Gucosidase from Lodgepole Pine 
Xylem Specific for the Lignin Precursor Coniferin. Plant Physiology 
1 07(2) (1 995), 331 -339 (enclosed as D9) 

- Dharmawardhana et al., cDNA cloning and heterologous expression 
of coniferin beta-glucosidase. Plant Molecular Biology 40(2) (1999), 
365-372 (enclosed as D10) 

Thus, a skilled person would not have expected that every beta-glucosidase 
he or she would choose is involved in plant resistance mechanisms. 

Moreover, even if one hypothetical ly assumes that the present invention is 
basically a selection among known beta-glucosidases, it has to be 
acknowledged that there is a high number of beta-glucosidases in plants. For 
instance, the Arabidopsis genome contains approximately 388 genes 
encoding glycosyl hydrolysases (see http://afmb.cnrs-mrs.fr/CAZY/) of which 
46 genes belong to family 1 to which PEN2 is also assigned (see Table 1 on 
page 6 of the description). Thus, even a selection among the known beta- 
glucosidases would have required inventive efforts. 
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3.4 Taking into account the technical problem mentioned above (section 3.2.3, 
supra), the claimed solution would not have been obvious in view of the cited 
prior art. In particular, none of these documents contain an indication that the 
disclosed beta-glucosidase may be of use in achieving broad resistance in 
plants. Thus, they would not have been taken into account by a skilled person 
when seeking an indication how the technical problem can be solved. Thus, 
the claimed solution would not have been obvious to be tried in view of D1 . 
D3,D3orD4. 

3.5 Moreover, the claimed solution would also not be obvious because none of the 
cited documents provides a basis for a reasonable expectation of success. As 
we have already pointed out above, the cited prior art documents are silent 
about establishing a broad resistance against plant pathogens. In addition, D3 
even points in the sentence on page 1013, bridging left and right column to an 
antagonist effect of the gene described in that document on pathogen versus 
insect resistance. Thus, D3, if at all, might present a basis for a negative 
expectation, i.e. that the beta-glucosidase which is described in D3 to be 
involved in resistance against insects might be negatively correlated with 
pathogen resistance. 

In addition, we note that non-obviousness of the present invention is further 
supported by the fact that the screening method by which the PEN2 knock-out 
mutant plant was identified (see section 3.2.1, supra) would not have been 
considered by a skilled person to achieve its goal. This is explained by the fact 
that nowhere in the prior art literature such a phenotypic screening approach 
is described or suggested. Thus, also because of this, a skilled person would 
not have had a reasonable expectation of success that the technical problem 
could be solved. 

Thus, the claimed solution would also not have been obvious because a 
skilled person would not have had a reasonable expectation of success. 

3.6 To conclude, we submit that a sound appreciation of the contribution of the 
invention to the art and the prior art knowledge leads to the conclusion that the 
claimed invention involves inventive step in view of Dl, D2, D3 and/or D4 
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because it would not have been obvious to try this solution and because there 
would not have been a reasonable expectation of success. 



With the above explanations and the proposed amendments to the claims, it is 
submitted that the Applicant has met the requirements as set forth in the Written 
Opinion. We thus request that a favorable International Preliminary Examination 
Report be issued in which novelty, inventive step and industrial applicability of all 
claims is acknowledged. 

If the Examiner cannot yet accept the new claims, a further written opinion or an 
informal communication pursuant to Rule 66 PCT is requested. 



Dr. Friederike Stolzenburg 
European Patent Attorney 

Enclosures 

New claims 1 to 26 
Exhibits 1 to 3 
Gerardi (D5) 
Leah (D6) 
Brzobohatv (D7) 
Dietz (D8) 

Dharmawardhana (D9) 
Dharmawardhana (D10) 
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CLAIMS 

1. A recombinant nucleic acid molecule comprising a polynucleotide selected 
from the group consisting of 

(a) polynucleotides comprising a nucleotide sequence encoding a 
polypeptide with the amino acid sequence of SEQ ID NO:2; 

(b) polynucleotides comprising the nucleotide sequence shown in SEQ ID 
NO:1; 

(c) polynucleotides comprising a nucleotide sequence encoding a fragment 
of the polypeptide encoded by a polynucleotide of (a) or (b), wherein 
said nucleotide sequence encodes a protein having b-glucosidase 
activity; 

(d) polynucleotides comprising a nucleotide sequence which encodes a 
polypeptide having a sequence identity of at least 50% to an amino acid 
sequence encoded by the polynucleotide of (a) or (b), wherein said 
polypeptide has 0-glucosidase activity; and 

(e) polynucleotides comprising a nucleotide sequence that deviates from 
the nucleotide sequence defined in (d) by the degeneracy of the genetic 
code; 

and a promoter operatively linked to said polynucleotide, said promoter being 
heterologous with respect to the polynucleotide. 

2. The recombinant nucleic acid molecule of claim 1 further comprising additional 
expression control sequences operably linked to said polynucleotide. 

3. A vector comprising the polynucleotide defined in claim 1 or the recombinant 
nucleic acid molecule of claim 1 or 2. 

4. The vector of claim 3 further comprising expression control sequences 
operably linked to said polynucleotide. 
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5. A method for producing genetically engineered host cells comprising 
introducing the polynucleotide defined in claim 1, the recombinant nucleic acid 
molecule of claim 1 or 2 or the vector of claim 3 or 4 into a host cell. 

6. A host cell which is genetically engineered with the polynucleotide defined in 
claim 1 , the recombinant nucleic acid molecule of claim 1 or 2 or the vector of 
claim 3 or 4 or obtainable by the method of claim 5. 

7. The host cell of claim 6 which is a bacterial, yeast, fungus, plant or animal cell. 

8. A method for the production of a polypeptide encoded by a polynucleotide 
defined in claim 1 in which the host cell of claim 6 or 7 is cultivated under 
conditions allowing for the expression of the polypeptide and in which the 
polypeptide is isolated from the cells and/or the culture medium. 

9. A polypeptide encoded by the polynucleotide defined in claim 1 or obtainable 
by the method of claim 8. 

1 0. An antibody specifically recognizing the polypeptide of claim 9. 

11. A method for producing a transgenic plant comprising the steps of 

(a) introducing the polynucleotide defined in claim 1, the recombinant 
nucleic acid molecule of claim 1 or 2 or the vector of claim 3 or 4 into 
the genome of a plant cell; and 

(b) regenerating the cell of (a) to a transgenic plant. 

12. A transgenic plant or plant tissue comprising the plant cells of claim 7 or 
obtainable by the method of claim 1 1 . 

13. A transgenic plant which shows an increased activity of the polypeptide 
encoded by the polynucleotide defined in claim 1 compared to a 
corresponding wild-type plant. 
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14. The transgenic plant of claim 12 or 13 which, upon an increased activity of the 
protein encoded by the polynucleotide defined in claim 1 , shows an increased 
resistance against a plant pathogen to which a corresponding wild-type plant 
is susceptible. 

1 5. Propagation material or harvestable parts of the transgenic plant of any one of 
claims 12 to 14 comprising plant cells of claim 7. 

16. A method for conferring pathogen resistance or increased pathogen resistance 
to a plant comprising the step of providing a transgenic plant in which the 
activity of the polypeptide encoded by the polynucleotide defined in claim 1 is 
increased. 

17. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide defined in claim 1 comprising the steps of 

(a) contacting a candidate compound with said polypeptide under 
conditions where said polypeptide is active; and 

(b) determining whether said candidate compound is hydrolyzed by said 
polypeptide. 

18. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide defined in claim 1 comprising the steps of 

(a) providing a 3-dimensional structure model of said polypeptide; and 

(b) determining the structure of a substrate that fits into the 3-dimensional 
structure model of (a). 

19. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide defined in claim 1 comprising the steps of 

(a) providing a mutant protein of said polypeptide the catalytic activity of 
which is abolished without losing substrate binding activity; 

(b) contacting a candidate compound with said mutant protein; and 

(c) determining whether the candidate compound is bound by said mutant 
protein. 



20. The method of any one of claims 17 to 19 furthermore comprising the step of 
determining whether the identified compound or a hydrolysis product thereof is 
capable of inducing or enhancing a defence response against a pathogen in a 
plant. 

21. A method for preparing a plant protection composition comprising the steps of 
the method of claims 17, 18, 19 and/or 20 and furthermore the step of 
formulating the identified compound or a hydrolysis product thereof in a form 
suitable for administering to plants. 

22. A compound, a hydrolysis product thereof or a plant protection composition 
identified or obtained by the method of any one of claims 16 to 21 . 

23. A kit comprising the polynucleotide defined in claim 1, the recombinant nucleic 
acid molecule of claim 1 or 2, the vector of claim 3 or 4, the polypeptide of 
claim 9, the antibody of claim 10 or the compound or a hydrolysis product 
thereof of claim 22. 

24. Use of the polynucleotide defined in claim 1, of the recombinant nucleic acid 
molecule of claim 1 or 2, of the vector of claim 3 or 4, of the host cell of claim 6 
or 7, of the polypeptide of claim 9, of the antibody of claim 1 0 or of the 
transgenic plant of any one of claims 12 to 15 for identifying a compound that 
is hydrolyzed by a polypeptide encoded by said polynucleotide. 

25. Use of the polynucleotide defined in claim 1 , of the recombinant nucleic acid 
molecule of claim 1 or 2, of the vector of claim 3 or 4, of the host cell of claim 6 
or 7 or of the polypeptide of claim 9, of the antibody of claim 10 or of the 
transgenic plant of any one of claims 12 to 15 for the preparation of a plant 
protection composition. 

26. Use of the polynucleotide defined in claim 1, of the recombinant nucleic acid 
molecule of claim 1 or 2, of the vector of claim 3 or 4, of the host cell of claim 6 
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or 7 or of the polypeptide of claim 9, of the antibody of claim 10 or of the 
transgenic plant of any one of claims 12 to 15 for establishing or enhancing a 
pathogen resistance in plants. 
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Polynucleotides encoding a beta-glucosidase and uses thereof 

The present invention relates to polynucleotides that encode a beta-glucosidase 
belonging to the plant protein class of family 1 glycoside hydrolases. The present 
invention furthermore relates to recombinant nucleic molecules and vectors 
containing these polynucleotides and to host cells, preferably plant cells, that are 
genetically engineered with these polynucleotides. In addition, the present invention 
relates to transgenic plants which show an increased activity of the beta-glucosidase 
of the invention, preferably leading to an improved resistance against plant 
pathogens. In another aspect, the present invention relates to screening methods for 
identifying a substrate for said beta-glucosidase and to the production of plant 
protection compositions from such substrates or hydrolysis products thereof. 
Furthermore, the present invention relates to methods and uses which apply the 
polynucleotides encoding the beta-glucosidase of the invention in order to improve 
resistance in plants against plant pathogens. 

In plants, a major form of resistance to disease caused by microbial pathogens is by 
expression of complementary gene pairs in the plant and pathogen, known 
respectively as resistance (R) and avirulence (avr) genes (Dangl and Jones, 2001). 
Direct or indirect interaction of their products triggers rapid cellular reprogramming 
that prevents pathogen colonization of the plant. Failure to express either one 
component results in disease susceptibility. The predominant class of plant R gene 
cloned from a diverse range of species encodes cytoplasmic proteins with the same 
modular structure as Nod proteins involved in animal innate immune responses 
(Dangl and Jones, 2001; Fluhr, 2001). 

Another form of plant defense is nonhost resistance, which describes the fact that an 
entire plant species is normally immune to a specific parasite (Heath, 2000). Because 
most plant species are immune to the majority of potentially pathogenic 
microorganisms, nonhost resistance most probably constitutes a major factor of 
resistance in the field. Also, nonhost resistance is known to be considerably more 
durable in nature compared to strain-specific R gene dependent resistance. Despite 



2 

its durable nature, nonhost resistance can depend on the recognition of a single 
pathogen molecule (e.g. recognition of the Phytophthora infestans Inf1 protein by 
Nicotiana benthamiana\ Kamoun et al., 1998). Generally, however, nonhost 
resistance is at present poorly understood at the molecular level. 
The Erysiphales, commonly known as the powdery mildews, are obligate biotrophic 
plant pathogenic fungi, belonging to the phylum Ascomycota. The fungi are the 
causal agent of the widespread powdery mildew disease. Powdery mildews were 
reported to infect at least 9,838 angiosperm plant species representing 1,617 genera 
(Amano, 1986). Remarkable is the host range variation of powdery mildew species. 
For example, the barley powdery mildew fungus (Blumeria graminis f sp horde/; Bgh) 
successfully colonizes barley but fails to infect even close relatives including wheat, 
rye, and oat. In contrast, Erysiphe cichoracearum has a much broader host range 
and infects several dicot plant species that belong to different families (e.g. 
Arabidopsis, squash, and tobacco). It is possible that host range is not determined by 
the absence/presence of static or inducible defense responses in a plant species but 
by lack of compatibility factors or absence of virulence factors in a powdery mildew 
species. 

Yet, the classic approaches of plant breeding focus on trying to introduce resistance 
traits from land races into commercially used plant varieties. This refers to genes 
conferring race-specific resistance as well as to non-host resistance. 
Since the advent of genetic engineering, many approaches have been made aiming 
at improving pest resistance in plants by producing transgenic plants that express 
recombinant gene constructs. For instance, it was tried to introduce pathogenesis- 
related protein genes into the plant genome in order to enhance the plant response 
to pathogen attack. However, for many pests, a sweeping success is still outstanding 
since one had to learn that plant-pathogen interactions are much more complex than 
previously imagined. Therefore, the introduction of one transgene often proved not to 
modify the complex regulatory network triggered upon infection in such a way that 
plant resistance was significantly improved. Thus, there is still a need to find effective 
tools for improving non-host resistance in plants. 
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Thus, the technical problem underlying the present invention is to provide means and 
methods that allow it to improve plants by establishing or enhancing a broad, 
preferably non-host resistance against plant pathogens. 

This technical problem is solved by the provision of the embodiments as 
characterized in the claims. 

Accordingly, the present invention relates to polynucleotides selected from the group 
consisting of 

(a) polynucleotides comprising a nucleotide sequence encoding a polypeptide 
with the amino acid sequence of SEQ ID NO:2; 

(b) polynucleotides comprising the nucleotide sequence shown in SEQ ID NO:1; 

(c) polynucleotides comprising a nucleotide sequence encoding a fragment of the 
polypeptide encoded by a polynucleotide of (a) or (b), wherein said nucleotide 
sequence encodes a protein having (3-glucosidase activity; 

(d) polynucleotides comprising a nucleotide sequence the complementary strand 
of which hybridizes to the polynucleotide of any one of (a) to (c), wherein said 
nucleotide sequence encodes a protein having p-glucosidase activity; and 

(e) polynucleotides comprising a nucleotide sequence that deviates from the 
nucleotide sequence defined in (d) by the degeneracy of the genetic code. 

Consequently, the present invention relates to polynucleotides encoding a 
polypeptide having beta-glucosidase activity, said polynucleotides preferably 
encoding a polypeptide comprising the amino acid sequence indicated in SEQ ID 
NO: 2. This polypeptide belongs to the protein class of family 1 glycoside hydrolases 
(Henrissat, Curr. Opin. Struct. Biol. 7 (1997), 637-644). 

The present invention is based on the finding that a mutant plant in which the gene 
encoding the polypeptide of the invention is inactivated due to the presence of a 
nonsense mutation shows a significant increase of susceptibility to non-host 
pathogens. 

This plant mutant, Arabidopsis pen2, has been produced in connection with the 
present invention by chemically mutagenizing Arabidopsis Col-3 plants (see Example 
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2). The M2 generation thereof has been phenotypically screened for an increase of 
penetration incidences. For this purpose, a screening method has been developed by 
which M2 plants were inoculated with barley powdery mildew (Bgh) and examined 
under stereomicroscope and UV light for sites of autofluorescence (Figure 2). Each 
site of autofluorescence was interpreted as a site of successful penetration by the 
pathogen resulting in a hypersensitive reaction that is recognizable by a release of 
polyphenolic compounds giving rise to autofluorescence. Thus, a significant increase 
of the amount of autofluorescence sites was indicative for a reduced ability of the 
plant to prevent pathogen penetration through the cell wall. A closer cytological 
analysis of the pen2 plants indeed revealed that the frequency of successful 
pathogen penetrations is about 30% of the incidents of pathogen attack which is a 
15-fold increase as compared to a corresponding frequency of about 2% in wild-type 
plants (see Example 2 and Figure 3). Interestingly, the capacity to ultimately 
successfully colonize the host plant remained unaltered in the pen2 mutant as 
compared to wild-type plants. Thus, the ability to counter pathogen penetration by a 
hypersensitive response, e.g. leading to programmed cell death, was not affected by 
the pen2 mutation. Based on these observations, it is concluded that the PEN2 
protein is involved in non-host resistance in Arabidopsis. Its function is exerted at an 
early stage of disease development and its site of action is, at least in part, the cell 
wall. 

Importantly, the defense of pen2 mutant plants is compromised with respect to 
different fungal pathogens (see Example 3). In response to challenges with 
Phytophthora infestans and wheat powdery mildew (Bgt), pen2 plants showed 
strikingly enhanced penetration frequencies and cell death incidents when compared 
to corresponding wild-type plants. Upon challenges with Pyricularia grisea, to which 
wild-type plants do not respond in any detectable way, pen2 plants show cell wall 
appositions at about 10% of the interaction sites. These results show that PEN2 is 
involved in non-host resistance to a broad scale of pathogens, at least to fungal 
pathogens of the taxonomic divisions oomycetes (P. infestans) and ascomycetes 
(Bgh, Bgt and P. grisea). In addition, it is evident that PEN2 has a dual role, namely 
playing part in static and in inducible defense. In P. grisea, the inactivation of PEN2 
leads to the break-down of the static cell wall barrier so that active defense 
mechanisms become necessary in order to counter pathogen attack. With respect to 
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Bgh, Bgt and P. infestans, additionally the inducible mechanisms are weakened in 
pen2 mutant plants. 

A further clue to the function of PEN2 has been revealed by analyzing the cell wall 
composition of uninfected wild-type and pen2 plants (see Example 4). The difference 
FTIR spectrum shown in Figure 7 indicates significant constitutive changes in the cell 
wall composition of pen2 plants in the carbohydrate and phenolics fingerprint. This 
additionally evidences that a site of action of PEN2 is the cell wall. 
By map-based cloning, the site of the pen2 mutation could be located to an open 
reading frame designated At2g44490 of chromosome 2 of the Arabidopsis genome 
(see Example 5). According to its structural characteristics, PEN2 can be assigned to 
the protein class of family 1 glycoside hydrolases. One such structural characteristic 
is the presence of the family 1 N-terminal signature (see Figure 9). Another 
characteristic is the presence of two glutamate residues at position 183 and 398 of 
SEQ ID NO:2. They lie within conserved sequence motifs which are described to be 
typical for family 1 glycoside hydrolases (TFNEP and (IA/)TENG; SEQ ID NOs:3 and 
4). In PEN2, the residue F is replaced by M which probably will not affect catalytic 
activity since both amino acids are apolar. The structural characteristics of PEN2 in 
comparison to other family 1 glycoside hydrolases are depicted in Table 1 and Figure 
12. In Czjzek (Proc. Natl. Acad. Sci. USA 97 (2000), 13555-13560) structural 
characteristics of family 1 glycoside hydrolases are summarized and are referred to 
herein. The glutamate residues E183 and E398 which are catalytically active are 
conserved throughout the family except for myrosinases which show Q and E. Amino 
acid residues in PEN2 which are predicted to be involved in determining substrate, 
preferably aglycone specificity are A190, K197, W370 and N459. This combination of 
amino acid residues is unique amongst all deduced Arabidopsis family 1 glycosyl 
hydrolases (Table 1), suggesting that PEN2 has a unique substrate specificity. 
Predicted residues responsible for glucose binding are: W450, E457 and W458. 

Table 1: Compilation of predicted catalytically active and aglycone specificity 
determining residues in all known 47 Arabidopsis family 1 glycosyl hydrolases 
including PEN2. cat. = catalytically active residues; agl. sp. = aglycone specificity 
determining residues; glue. Bind. = glucose binding residues (according to Czjzek et 
al., 2000). 
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The classification of PEN2 as a beta-glucosidase is based on its overall homology 
with the closest relatives in the protein family as well as on the presence of the N- 
terminal signature (Figure 9). In the GenBank/EMBL database entry AC004521, the 
pen2 ORF has already been identified as a putative beta-glucosidase. However, it 
was not yet clear whether this ORF is actually expressed, let alone that nothing was 
known on a crucial role of the PEN2 protein encoded by this ORF in non-host 
resistance. 

From the results discussed above, it is evident that the PEN2 protein is involved in 
the establishment of non-host resistance in plants. Thus, it can be expected that an 
increase of the activity of this protein in plants may establish or enhance resistance, 
preferably non-host resistance in said plants. 

In the context of the present invention, the term "pathogen" refers to organisms that 
attack plants. It includes, for example, bacteria, viruses, viroids, fungi and protozoa. 
Among fungal pathogens, species belonging to the taxonomic groups oomycota, 
ascomycetes and basidiomycetes (see for reference, e.g., Strasburger, Lehrbuch der 
Botanik, 33 rd edition, 1991, G. Fischer Verlag, Stuttgart, Jena, New-York) are 
pathogens of particular interest in the context of the present invention. Examples of 
important pathogens are Phytophthora infestans (the causal agent of potato late 
blight disease), Phytophthora sojae (root rot in soybean), Peronospora parasitica 
(downy mildew), Magnaporthe grisea (rice blast disease), Erysiphe spp (powdery 
mildew), Pseudomonas syringae (bacterial blight), Erwinia amylovora (fire blight 
disease), Erwinia carotovora (soft rot), Botrytis cinerea (downy mildew of grape), 
Rhizoctonia solani and Pythium debaryanum (agents of seedling blight or damping 
off disease). 

The term "susceptibility" refers to the capacity of a given pathogen to grow on or in 
the tissue of a plant. In particular, "susceptibility" refers to the growth of a pathogen 
on the epidermal surface and from there into the epidermis and subepidermal tissue, 
e.g. the mesophyll. Thus, the term "susceptibility" also covers incidents of pathogen 
attacks where the pathogen grows for a certain while on the host plant, however, 
without being capable to take up nutrients from the host and therefore without 
successfully colonising the host plant. In particular, successful colonization is 
characterized by completing that part of the pathogen's life cycle which takes place 
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on the plant host. With regard to fungal pathogens, like for instance powdery mildew, 
such a successful colonisation is for instance apparent from the formation of a 
haustorium and of secondary hyphae. 

The term "non-host" refers to plant-pathogen interactions between all individuals of 
one plant species and all individuals of one pathogen species, wherein none of such 
interactions lead to a successful colonisation of the pathogen on the plant. Thus, the 
term "non-host pathogen" refers to a pathogen species to which a certain plant 
species shows non-host resistance, i.e. all individuals of the plant species are 
resistant against all individuals of the pathogen species. 

In connection with the present invention, the term "resistant" or "resistance" refers to 
the property of a given plant or plant species to protect itself against an attack by a 
certain pathogen, whereby said protection may range from a delay to a complete 
inhibition of disease development. Preferably, "resistance" refers to an effective block 
of pathogen growth on or in said plant or plant species so that the pathogen is not 
able to successfully colonize the plant or plant species. Generally, resistance 
involves an interplay of various means that aim at blocking penetration of the 
pathogen into the plant. This may refer to static properties of the plant, i.e. structural, 
chemical or other characteristics of the plant that prevent or reduce pathogen 
penetration and which are constitutively present in the plant, i.e. independent of 
whether there is a pathogen attack or not. On the other hand, such means may also 
be inducible mechanisms with which a plant reacts to pathogen attack. Among these 
mechanisms, cell wall appositions, the expression of pathogen-related proteins and 
hypersensitive cell death are prominent examples. In connection with the present 
invention, resistance is preferably exerted at the level of cell wall penetration. This 
means that the provisions of the invention preferably improve or establish resistance 
against a pathogen by decreasing its capacity to overcome a cell wall barrier, which 
preferably is the outer cell wall of the epidermis or rhizodermis. It is furthermore 
preferred that the provisions of the invention improve or establish resistance against 
a pathogen by improving static defense (i.e. the capacity of a plant to prevent cell 
wall penetration by a pathogen without detectably inducing an inducible defense 
mechanism) and/or by improving one or more inducible defense mechanisms, in 
particular cell wall remodelling. The term "cell wall remodelling" refers to any 
structural changes of a cell wall that may take place in a plant as a response to 



pathogen attack such as cell wall apposition or encasement of parts of the pathogen 
such as a haustorium in material comprising callose. 

Furthermore, the term "improved resistance" refers to a significant reduction of 
susceptibility to a pathogen in plants treated according to the provisions of the 
present invention as compared to corresponding untreated plants. In particular, such 
a reduction of susceptibility may be evident from a significant reduction of penetration 
events and/or a significant reduction of hypersensitive reactions as for instance 
visible by fluorescence detection. Preferably, such a reduction of susceptibility of a 
plant so-treated is by at least 10%, more preferably at least 20%, still more preferably 
by at least 50%, even more preferably by at least 80% and most preferably to 
approximately 100% as compared to an untreated plant and with respect to the 
number of penetration events and/or hypersensitive reactions. The term "treated 
according to the provisions of the invention" refers to any means of enhancing the 
activity of the protein of the invention in the respective plant such as by 
overexpressing the protein in transgenic plants or by adding exogenous substrate. 
Said term may also refer to the administration of the enzymatic product of the protein 
of the invention which may give rise to an effect similar to directly enhancing the 
activity of the protein of the invention. 

The invention in particular relates to polynucleotides containing the nucleotide 
sequence indicated under SEQ ID NO: 1 or encoding the amino acid sequence 
shown under SEQ ID NO: 2 or a part thereof having beta-glucosidase activity. 
The term "beta-glucosidase activity" refers to the activities of the polypeptide with the 
amino acid sequence shown in SEQ ID NO:2. In particular, this term refers to the 
enzymatic activity acting on substrates containing at least one glycosidic bond such 
as for instance oligosaccharides, polysaccharides or conjugates between at least one 
glycosidic group (also called "glycone" or "carbohydrate") and a non-glycosidic moiety 
(also called "aglycone" or "non-carbohydrate"), whereby this bond is hydrolyzed by 
the hydrolytic activity of the polypeptide. The polypeptide of the invention belongs to 
the class of family 1 glycoside hydrolases (http://afmb.cnrs-mrs.fr/-cazy/CASY and 
Henrissat, Curr. Opin. Struct. Biol. 7 (1997), 637-644). Enzymes of this class either 
hydrolyze O-linked beta-glycosidic bonds (beta-D-glucoside glycohydrolase; EC 
3.2.1.21) or S-linked beta-glycosidic bonds (myrosinase or beta-D-thioglucoside 
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glycohydrolase; EC 3.2.3.1). The structural characteristics of the polypeptide of the 
invention are described above in connection with PEN2 and are depicted in Table 1. 
In addition, the activity of the polypeptide in plants also manifests itself in the cell wall 
composition. Thus, it may for instance also be determined whether the protein of the 
invention is expressed and active in a given plant tissue sample by comparing the 
cell wall composition of said sample with the cell wall composition from plant tissue 
where the protein of the invention is known to be active and/or with the cell wall 
composition from plant tissue where the protein of the invention is known not to be 
active. Corresponding techniques for analyzing the composition of cell walls are 
known by a person skilled in the art and include for instance Fourier transform 
infrared (FT-IR) microspectroscopy. By this technique, it is possible to detect 
significant differences of the cell wall composition, in particular with regard to 
carbohydrates and phenolics (see also Example 4). 

Moreover, the present invention relates to polynucleotides which encode a 
polypeptide having beta-glucosidase activity and the complementary strand of which 
hybridizes with a polynucleotide mentioned in sections (a) to (c), above. 
The present invention also relates to polynucleotides which encode a polypeptide, 
which has a homology, that is to say a sequence identity, of at least 30%, preferably 
of at least 40%, more preferably of at least 50%, even more preferably of at least 
60% and particularly preferred of at least 70%, especially preferred of at least 80% 
and even more preferred of at least 90% to the entire amino acid sequence as 
indicated in SEQ ID NO: 2, the polypeptide having beta-glucosidase activity. 
Moreover, the present invention relates to polynucleotides which encode a 
polypeptide having beta-glucosidase activity and the nucleotide sequence of which 
has a homology, that is to say a sequence identity, of at least 40%, preferably of at 
least 50%, more preferably of at least 60%, even more preferably of more than 65%, 
in particular of at least 70%, especially preferred of at least 80%, in particular of at 
least 90% and even more preferred of at least 95% when compared to the coding 
region of the sequence shown in SEQ ID NO: 1. 

It is particularly preferred that polynucleotides of the invention that encode a 
polypeptide having beta-glucosidase show the structural characteristics described 
above for PEN2 and depicted in Table 1 . 
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The present invention also relates to polynucleotides, which encode a polypeptide 
having beta-glucosidase activity and the sequence of which deviates from the 
nucleotide sequences of the above-described polynucleotides due to the degeneracy 
of the genetic code. 

The invention also relates to polynucleotides comprising a nucleotide sequence 
which is complementary to the whole or a part of one of the above-mentioned 
sequences. 

In the context of the present invention the term "hybridization" means hybridization 
under conventional hybridization conditions, preferably under stringent conditions, as 
for instance described in Sambrook and Russell (2001), Molecular Cloning: A 
Laboratory Manual, CSH Press, Cold Spring Harbor, NY, USA. In an especially 
preferred embodiment, the term "hybridization" means that hybridization occurs 
under the following conditions: 

Hybridization buffer: 2 x SSC; 10 x Denhardt solution (Fikoll 400 + PEG + 

BSA; ratio 1:1:1); 0.1% SDS; 5 mM EDTA; 50 mM 
Na 2 HP0 4; 

250 |xg/ml of herring sperm DNA; 50 jig/ml of tRNA; 
or 

0.25 M of sodium phosphate buffer, pH 7.2; 
1 mM EDTA 
7% SDS 

Hybridization temperature T = 60°C 

Washing buffer: 2 x SSC; 0.1 % SDS 

Washing temperature T = 60°C. 

Polynucleotides which hybridize with the polynucleotides of the invention can, in 
principle, encode a polypeptide having beta-glucosidase activity from any organism 
expressing such polypeptides or can encode modified versions thereof. 
Polynucleotides which hybridize with the polynucleotides disclosed in connection with 
the invention can for instance be isolated from genomic libraries or cDNA libraries of 
bacteria, fungi, plants or animals. Preferably, such polynucleotides are from plant 



origin, particularly preferred from a plant belonging to the dicotyledons, more 
preferably from the family of Brassicaceae. Preferably, the polynucleotide of the 
invention is not a polynucleotide with or comprising the nucleotide sequence shown 
in SEQ ID NO:1, and is a variant of such a polynucleotide as described herein. 
Preferably, the polynucleotide of the invention is a variant, preferably an ortholog of a 
polynucleotide comprising SEQ ID NO:1 and may for example comprise a nucleotide 
sequence that originates from an agronomically important crop species such as from 
rape seed or cabbage varieties. Alternatively, such polynucleotides can be prepared 
by genetic engineering or chemical synthesis. 

Such hybridizing polynucleotides may be identified and isolated by using the 
polynucleotides described hereinabove or parts or reverse complements thereof, for 
instance by hybridization according to standard methods (see for instance Sambrook 
and Russell (2001), Molecular Cloning: A Laboratory Manual, CSH Press, Cold 
Spring Harbor, NY, USA). Polynucleotides comprising the same or substantially the 
same nucleotide sequence as indicated in SEQ ID NO: 1 or parts thereof can, for 
instance, be used as hybridization probes. The fragments used as hybridization 
probes can also be synthetic fragments which are prepared by usual synthesis 
techniques, and the sequence of which is substantially identical with that of a 
polynucleotide according to the invention. 

The molecules hybridizing with the polynucleotides of the invention also comprise 
fragments, derivatives and allelic variants of the above-described polynucleotides 
encoding a polypeptide having beta-glucosidase activity. Herein, fragments are 
understood to mean parts of the polynucleotides which are long enough to encode 
the described polypeptide, preferably showing the biological activity of a polypeptide 
of the invention as described above. In this context, the term derivative means that 
the sequences of these molecules differ from the sequences of the above-described 
polynucleotides in one or more positions and show a high degree of homology to 
these sequences, preferably within the preferred ranges of homology mentioned 
above. 

Preferably, the degree of homology is determined by comparing the respective 
sequence with the nucleotide sequence of the coding region of SEQ ID NO: 1. When 
the sequences which are compared do not have the same length, the degree of 
homology preferably refers to the percentage of nucleotide residues in the shorter 
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sequence which are identical to nucleotide residues in the longer sequence. The 
degree of homology can be determined conventionally using known computer 
programs such as the DNASTAR program with the ClustalW analysis. This program 
can be obtained from DNASTAR, Inc., 1228 South Park Street, Madison, Wl 53715 
or from DNASTAR, Ltd., Abacus House, West Ealing, London W13 OAS UK 
(support@dnastar.com) and is accessible at the server of the EMBL outstation. 
When using the Clustal analysis method to determine whether a particular sequence 
is, for instance, 80% identical to a reference sequence the settings are preferably as 
follows: Matrix: blosum 30; Open gap penalty: 10.0; Extend gap penalty: 0.05; Delay 
divergent: 40; Gap separation distance: 8 for comparisons of amino acid sequences. 
For nucleotide sequence comparisons, the Extend gap penalty is preferably set to 
5.0. 

Preferably, the degree of homology of the hybridizing polynucleotide is calculated 
over the complete length of its coding sequence. It is furthermore preferred that such 
a hybridizing polynucleotide, and in particular the coding sequence comprised 
therein, has a length of at least 300 nucleotides, preferably at least 500 nucleotides, 
more preferably of at least 750 nucleotides, even more preferably of at least 1000 
nucleotides, particularly preferred of at least 1500 nucleotides and most preferably of 
at least 2000 nucleotides. 

Preferably, sequences hybridizing to a polynucleotide according to the invention 
comprise a region of homology of at least 90%, preferably of at least 93%, more 
preferably of at least 95%, still more preferably of at least 98% and particularly 
preferred of at least 99% identity to an above-described polynucleotide, wherein this 
region of homology has a length of at least 500 nucleotides, more preferably of at 
least 750 nucleotides, even more preferably of at least 1000 nucleotides, particularly 
preferred of at least 1500 nucleotides and most preferably of at least 2000 
nucleotides. 

Homology, moreover, means that there is a functional and/or structural equivalence 
between the corresponding polynucleotides or polypeptides encoded thereby. 
Polynucleotides which are homologous to the above-described molecules and 
represent derivatives of these molecules are normally variations of these molecules 
which represent modifications having the same biological function. They may be 
either naturally occurring variations, for instance sequences from other ecotypes, 
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varieties, species, etc., or mutations, and said mutations may have formed naturally 
or may have been produced by deliberate mutagenesis. Furthermore, the variations 
may be synthetically produced sequences. The allelic variants may be naturally 
occurring variants or synthetically produced variants or variants produced by 
recombinant DNA techniques. Deviations from the above-described polynucleotides 
may have been produced, e.g., by deletion, substitution, insertion and/or 
recombination. 

The polypeptides encoded by the different variants of the polynucleotides of the 
invention possess certain characteristics they have in common. These include for 
instance biological activity, molecular weight, immunological reactivity, conformation, 
etc., and physical properties, such as for instance the migration behavior in gel 
electrophoreses, chromatographic behavior, sedimentation coefficients, solubility, 
spectroscopic properties, stability, pH optimum, temperature optimum etc. 
The biological activity of a polypeptide of the invention, in particular the capacity to 
hydrolyze the glycosidic bond in a substrate for which it is specific can be tested in 
conventional enzyme assays using the substrate of the polypeptide or a suitable 
modified form thereof. 

The invention also relates to oligonucleotides specifically hybridizing to a 
polynucleotide of the invention. Such oligonucleotides have a length of preferably at 
least 10, in particular at least 15, and particularly preferably of at least 50 
nucleotides. Advantageously, their length does not exceed a length of 1000, 
preferably 500, more preferably 200, still more preferably 100 and most preferably 50 
nucleotides. They are characterized in that they specifically hybridize to the 
polynucleotides of the invention, that is to say that they do not or only to a very minor 
extent hybridize to nucleic acid sequences encoding another beta-glucosidase. The 
oligonucleotides of the invention can be used for instance as primers for amplification 
techniques such as the PCR reaction or as a hybridization probe to isolate related 
genes. The hybridization conditions and homology values described above in 
connection with the polynucleotide encoding a polypeptide having beta-glucosidase 
activity may likewise apply in connection with the oligonucleotides mentioned herein. 
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The polynucleotides of the invention can be DNA molecules, in particular genomic 
DNA or cDNA. Moreover, the polynucleotides of the invention may be RNA 
molecules. The polynucleotides of the invention can be obtained for instance from 
natural sources or may be produced synthetically or by recombinant techniques, 
such as PCR. 

In another aspect, the present invention relates to recombinant nucleic acid 
molecules comprising the polynucleotide of the invention described above. The term 
"recombinant nucleic acid molecule" refers to a nucleic acid molecule which contains 
in addition to a polynucleotide of the invention as described above at least one 
further heterologous coding or non-coding nucleotide sequence. The term 
"heterologous" means that said polynucleotide originates from a different species or 
from the same species, however, from another location in the genome than said 
added nucleotide sequence. The term "recombinant" implies that nucleotide 
sequences are combined into one nucleic acid molecule by the aid of human 
intervention. The recombinant nucleic acid molecule of the invention can be used 
alone or as part of a vector. 

For instance, the recombinant nucleic acid molecule may encode the polypeptide 
having beta-glucosidase activity fused to a marker sequence, such as a peptide, 
which facilitates purification of the fused polypeptide. The marker sequence may for 
example be a hexa-histidine peptide, such as the tag provided in a pQE vector 
(Qiagen, Inc.) which provides for convenient purification of the fusion polypeptide. 
Another suitable marker sequence may be the HA tag which corresponds to an 
epitope derived from influenza hemagglutinin polypeptide (Wilson, Cell 37 (1984), 
767). As a further example, the marker sequence may be glutathione-S-transf erase 
(GST) which, apart from providing a purification tag, enhances polypeptide stability, 
for instance, in bacterial expression systems. 

In a preferred embodiment, the recombinant nucleic acid molecules further comprise 
expression control sequences operably linked to the polynucleotide comprised by the 
recombinant nucleic acid molecule, more preferably these recombinant nucleic acid 
molecules are expression cassettes. The term "operatively linked", as used 
throughout the present description, refers to a linkage between one or more 
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expression control sequences and the coding region in the polynucleotide to be 
expressed in such a way that expression is achieved under conditions compatible 
with the expression control sequence. 

Expression comprises transcription of the heterologous DNA sequence, preferably 
into a translatable mRNA. Regulatory elements ensuring expression in prokaryotic as 
well as in eukaryotic cells, preferably in plant cells, are well known to those skilled in 
the art. They encompass promoters, enhancers, termination signals, targeting signals 
and the like. Examples are given further below in connection with explanations 
concerning vectors. In the case of eukaryotic cells, expression control sequences 
may comprise poly-A signals ensuring termination of transcription and stabilization of 
the transcript, for example, those of the 35S RNA from Cauliflower Mosaic Virus 
(CaMV) or the nopaline synthase gene from Agrobacterium tumefaciens. Additional 
regulatory elements may include transcriptional as well as translational enhancers. A 
plant translational enhancer often used is the CaMV omega sequences. Similarly, the 
inclusion of an intron (e.g. intron-1 from the shrunken gene of maize) has been 
shown to increase expression levels by up to 100-fold (Mait, Transgenic Research 6 
(1997), 143-156; Ni, Plant Journal 7 (1995), 661-676). 

Moreover, the invention relates to vectors, in particular plasmids, cosmids, viruses, 
bacteriophages and other vectors commonly used in genetic engineering, which 
contain the above-described polynucleotides of the invention. In a preferred 
embodiment of the invention, the vectors of the invention are suitable for the 
transformation of fungal cells, cells of microorganisms such as yeast or bacterial 
cells, animal cells or, in particular, plant cells. In a particularly preferred embodiment 
such vectors are suitable for stable transformation of plants. 

In another preferred embodiment, the vectors further comprise expression control 
sequences operably linked to said polynucleotides contained in the vectors. These 
expression control sequence may be suited to ensure transcription and synthesis of a 
translatable RNA in prokaryotic or eukaryotic cells. 

The expression of the polynucleotides of the invention in prokaryotic or eukaryotic 
cells, for instance in Escherichia co//, is interesting because it permits a more precise 
characterization of the biological activities of the encoded polypeptide. In particular, 
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recombinantly expressed polypeptide may be used to identify substrate compounds 
that are hydrolyzed by its activity. Moreover, it is possible to express these 
polypeptides in such prokaryotic or eukaryotic cells which are free from interfering 
polypeptides. In addition, it is possible to insert different mutations into the 
polynucleotides by methods usual in molecular biology (see for instance Sambrook 
and Russell (2001), Molecular Cloning: A Laboratory Manual, CSH Press, Cold 
Spring Harbor, NY, USA), leading to the synthesis of polypeptides possibly having 
modified biological properties. In this regard it is on the one hand possible to produce 
deletion mutants in which polynucleotides are produced by progressive deletions 
from the 5' or 3' end of the coding DNA sequence, and said polynucleotides lead to 
the synthesis of correspondingly shortened polypeptides. 

On the other hand, the introduction of point mutations is also conceivable at positions 
at which a modification of the amino acid sequence for instance influences the 
biological activity or the regulation of the polypeptide. 

Moreover, mutants possessing a modified substrate or product specificity can be 
prepared. Furthermore, it is possible to prepare mutants having a modified activity- 
temperature-profile. Preferably, such mutants show an increased activity. 
Alternatively, mutants can be prepared the catalytic activity of which is abolished 
without loosing substrate binding activity. Such a mutant may for example be 
produced by substituting the amino acid residue corresponding to the glutamate 
residue at position 183 of SEQ ID NO: 2 by another amino acid residue, e.g. an 
aspartate residue. This mutagenesis may be earned out as for instance described in 
Czjzek (Proc. Natl. Sci. USA 97 (2000), 13555-13560). Such inactivated but still 
binding mutants may be useful for screening for substrate compounds by the so- 
called substrate trap technique. 

Furthermore, in the case of expression in plants, plant tissue or plant cells, the 
introduction of mutations into the polynucleotides of the invention allows the gene 
expression rate and/or the activity of the polypeptides encoded by the 
polynucleotides of the invention to be reduced or increased. 

For genetic engineering in prokaryotic cells, the polynucleotides of the invention or 
parts of these molecules can be introduced into plasmids which permit mutagenesis 
or sequence modification by recombination of DNA sequences. Standard methods 
(see Sambrook and Russell (2001), Molecular Cloning: A Laboratory Manual, CSH 
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Press, Cold Spring Harbor, NY, USA) allow base exchanges to be performed or 
natural or synthetic sequences to be added. DNA fragments can be connected to 
each other by applying adapters and linkers to the fragments. Moreover, engineering 
measures which provide suitable restriction sites or remove surplus DNA or 
restriction sites can be used. In those cases, in which insertions, deletions or 
substitutions are possible, in vitro mutagenesis, "primer repair", restriction or ligation 
can be used. In general, a sequence analysis, restriction analysis and other methods 
of biochemistry and molecular biology are carried out as analysis methods. 

Additionally, the present invention relates to a method for producing genetically 
engineered host cells comprising introducing the above-described polynucleotides, 
recombinant nucleic acid molecules or vectors of the invention into a host cell. 

Another embodiment of the invention relates to host cells, in particular prokaryotic or 
eukaryotic cells, genetically engineered with the above-described polynucleotides, 
recombinant nucleic acid molecules or vectors of the invention or obtainable by the 
above-mentioned method for producing genetically engineered host cells, and to 
cells derived from such transformed cells and containing a polynucleotide, 
recombinant nucleic acid molecule or vector of the invention. In a preferred 
embodiment the host cell is genetically modified in such a way that it contains a 
polynucleotide stably integrated into the genome. Preferentially, the host cell of the 
invention is a bacterial, yeast, fungus, plant or animal cell. 

More preferably the polynucleotide can be expressed so as to lead to the production 
of a polypeptide having beta-glucosidase activity. An overview of different expression 
systems is for instance contained in Methods in Enzymology 153 (1987), 385-516, in 
Bitter et al. (Methods in Enzymology 153 (1987), 516-544) and in Sawers et al. 
(Applied Microbiology and Biotechnology 46 (1996), 1-9), Billman-Jacobe (Current 
Opinion in Biotechnology 7 (1996), 500-4), Hockney (Trends in Biotechnology 12 
(1994), 456-463), Griffiths et al., (Methods in Molecular Biology 75 (1997), 427-440). 
An overview of yeast expression systems is for instance given by Hensing et al. 
(Antonie van Leuwenhoek 67 (1995), 261-279), Bussineau et al. (Developments in 
Biological Standardization 83 (1994), 13-19), Gellissen et al. (Antonie van 
Leuwenhoek 62 (1992), 79-93, Fleer (Current Opinion in Biotechnology 3 (1992), 
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486-496), Vedvick (Current Opinion in Biotechnology 2 (1991), 742-745) and 
Buckholz (Bio/Technology 9 (1991), 1067-1072). 

Expression vectors have been widely described in the literature. As a rule, they 
contain not only a selection marker gene and a replication-origin ensuring replication 
in the host selected, but also a bacterial or viral promoter, and in most cases a 
termination signal for transcription. Between the promoter and the termination signal 
there is in general at least one restriction site or a polylinker which enables the 
insertion of a coding DNA sequence. The DNA sequence naturally controlling the 
transcription of the corresponding gene can be used as the promoter sequence, if it 
is active in the selected host organism. However, this sequence can also be 
exchanged for other promoter sequences. It is possible to use promoters ensuring 
constitutive expression of the gene and inducible promoters which permit a deliberate 
control of the expression of the gene. Bacterial and viral promoter sequences 
possessing these properties are described in detail in the literature. Regulatory 
sequences for the expression in microorganisms (for instance E. coli, S. cerevisiae) 
are sufficiently described in the literature. Promoters permitting a particularly high 
expression of a downstream sequence are for instance the T7 promoter (Studier et 
al., Methods in Enzymology 185 (1990), 60-89), lacUVS, trp, trp-lacUV5 (DeBoer et 
al., in Rodriguez and Chamberlin (Eds), Promoters, Structure and Function; Praeger, 
New York, (1982), 462-481; DeBoer et al., Proc. Natl. Acad. Sci. USA (1983), 21-25), 
Ip1, rac (Boros et al., Gene 42 (1986), 97-100). Inducible promoters are preferably 
used for the synthesis of polypeptides. These promoters often lead to higher 
polypeptide yields than do constitutive promoters. In order to obtain an optimum 
amount of polypeptide, a two-stage process is often used. First, the host cells are 
cultured under optimum conditions up to a relatively high cell density. In the second 
step, transcription is induced depending on the type of promoter used. In this regard, 
a tac promoter is particularly suitable which can be induced by lactose or IPTG 
(=isopropyl-fi-D-thiogalactopyranoside) (deBoer et al., Proc. Natl. Acad. Sci. USA 80 
(1983), 21-25). Termination signals for transcription are also described in the 
literature. 

The transformation of the host cell with a polynucleotide or vector according to the 
invention can be carried out by standard methods, as for instance described in 
Sambrook and Russell (2001), Molecular Cloning: A Laboratory Manual, CSH Press, 
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Cold Spring Harbor, NY, USA; Methods in Yeast Genetics, A Laboratory Course 
Manual, Cold Spring Harbor Laboratory Press, 1990. The host cell is cultured in 
nutrient media meeting the requirements of the particular host cell used, in particular 
in respect of the pH value, temperature, salt concentration, aeration, antibiotics, 
vitamins, trace elements etc. The polypeptide according to the present invention can 
be recovered and purified from recombinant cell cultures by methods including 
ammonium sulfate orethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Polypeptide refolding steps can be used, as necessary, in 
completing configuration of the polypeptide. Finally, high performance liquid 
chromatography (HPLC) can be employed for final purification steps. 

Accordingly, the present invention also relates to a method for the production of a 
polypeptide encoded by a polynucleotide of the invention as described above in 
which the above-mentioned host cell is cultivated under conditions allowing for the 
expression of the polypeptide and in which the polypeptide is isolated from the cells 
and/or the culture medium. 

Moreover, the invention relates to a polypeptide which is encoded by a 
polynucleotide according to the invention or obtainable by the above-mentioned 
method for the production of a polypeptide encoded by a polynucleotide of the 
invention. 

The polypeptide of the present invention may, e.g., be a naturally purified product or 
a product of chemical synthetic procedures or produced by recombinant techniques 
from a prokaryotic or eukaroytic host (for example, by bacterial, yeast, higher plant, 
insect and mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptide of the present invention may be 
glycosylated or may be non-glycosylated. The polypeptide of the invention may also 
include an initial methionine amino acid residue. The polypeptide according to the 
invention may be further modified to contain additional chemical moieties not 
normally part of the polypeptide. Those derivatized moieties may, e.g., improve the 
stability, solubility, the biological half life or absorption of the polypeptide. The 
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moieties may also reduce or eliminate any undesirable side effects of the polypeptide 
and the like. An overview for these moieties can be found, e.g., in Remington's 
Pharmaceutical Sciences (18 th ed., Mack Publishing Co., Easton, PA (1990)). 
Polyethylene glycol (PEG) is an example for such a chemical moiety which has been 
used for the preparation of therapeutic polypeptides. The attachment of PEG to 
polypeptides has been shown to protect them against proteolysis (Sada et al., J. 
Fermentation Bioengineering 71 (1991), 137-139). Various methods are available for 
the attachment of certain PEG moieties to polypeptides (for review see: Abuchowski 
et al., in "Enzymes as Drugs"; Holcerberg and Roberts, eds. (1981), 367-383). 
Generally, PEG molecules are connected to the polypeptide via a reactive group 
found on the polypeptide. Amino groups, e.g. on lysines or the amino terminus of the 
polypeptide are convenient for this attachment among others. 

Furthermore, the present invention also relates to an antibody specifically 
recognizing a polypeptide according to the invention. The antibody can be 
monoclonal or polyclonal and can be prepared according to methods well known in 
the art. The term "antibody" also comprises fragments of an antibody which still retain 
the binding specificity. 

The polypeptide according to the invention, its fragments or other derivatives thereof, 
or cells expressing them can be used as an immunogen to produce antibodies 
thereto. The present invention in particular also includes chimeric, single chain, and 
humanized antibodies, as well as Fab fragments, or the product of an Fab expression 
library. Various procedures known in the art may be used for the production of such 
antibodies and fragments. 

Antibodies directed against a polypeptide according to the present invention can be 
obtained, e.g., by direct injection of the polypeptide into an animal or by administering 
the polypeptide to an animal, preferably a non-human animal. The antibody so 
obtained will then bind the polypeptide itself. In this manner, even a sequence 
encoding only a fragment of the polypeptide can be used to generate antibodies 
binding the whole native polypeptide. Such antibodies can then, e.g., be used to 
isolate the polypeptide from tissue expressing that polypeptide or to detect it in a 
probe. For the preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples for such 
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techniques include the hybridoma technique (Kohler and Milstein, Nature 256 (1975), 
495-497), the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 
Immunology Today 4 (1983), 72) and the EBV-hybridoma technique to produce 
human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc. (1985), 77-96). Techniques describing the production of 
single chain antibodies (e.g., U.S. Patent 4,946,778) can be adapted to produce 
single chain antibodies to immunogenic polypeptides according to the present 
invention. Furthermore, transgenic mice may be used to express humanized 
antibodies directed against immunogenic polypeptides of the present invention. 

Furthermore, the invention relates to a method for producing a transgenic plant 
comprising the steps of 

(a) introducing at least one of the above-described polynucleotides, recombinant 
nucleic acid molecules or vectors of the invention into the genome of a plant 
cell; and 

(b) regenerating the cell of (a) to a transgenic plant. 

Optionally, the method may further comprise step (c) producing progeny from the 
plants produced in step (b). 

In a further aspect, the invention relates to transgenic plants or plant tissue 
comprising plant cells which are genetically engineered with the polynucleotide of the 
invention or which contain the recombinant nucleic acid molecule or the vector of the 
invention or to transgenic plants obtainable by the method mentioned above. 
Preferably, in the transgenic plant of the invention, the polynucleotide of the invention 
is expressed at least in one part, i.e. organ, tissue or cell type, of the plant. 
Preferably, this expression leads to an increase of beta-glucosidase activity in the 
cells which express said polynucleotide or in the environment of such cells, e.g. in 
the apoplast, in particular in the cell wall. Increase of activity can be detected for 
instance by measuring the amount of transcript and/or protein in the transformed cell, 
tissue or plant in comparison to corresponding measurements at non-transformed 
plant cells, tissue or plants. According to the teachings of the present invention, an 
increase of the activity of the polypeptide of the invention in transgenic plants leads 
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to an increase of resistance against a plant pathogen to which a corresponding wild- 
type plant is susceptible or at least more susceptible. 

The polynucleotide introduced into the transgenic plant can in principle be expressed 
in all or substantially all cells of the plant However, it is also possible that it is only 
expressed in certain parts, organs, cell types, tissues etc. Moreover, it is possible that 
expression of the polynucleotide only takes place upon induction, at a certain 
developmental stage or, as it may be preferred in some embodiments, upon 
pathogen attack. In a preferred embodiment, the polynucleotide is expressed in those 
parts of the plant that are exposed to pathogen attack, for example the epidermis or 
the rhizodermis. 

In order to be expressed, the polynucleotide that is introduced into a plant cell is 
preferably operatively linked to one or more expression control sequences, e.g. a 
promoter, active in this plant cell. 

The promoter may be homologous or heterologous with regard to its origin and/or 
with regard to the gene to be expressed. Suitable promoters are for instance the 
promoter of the 35S RNA of the Cauliflower Mosaic Virus (see for instance US-A 
5,352,605) and the ubiquitin-promoter (see for instance US-A 5,614,399) which lend 
themselves to constitutive expression, the patatin gene promoter B33 (Rocha-Sosa 
et al., EMBO J. 8 (1989), 23-29) which lends itself to a tuber-specific expression in 
potatoes or a promoter ensuring expression in photosynthetically active tissues only, 
for instance the ST-LS1 promoter (Stockhaus et al., Proc. Natl. Acad. Sci. USA 84 
(1987), 7943-7947; Stockhaus et al., EMBO, J. 8 (1989) 2445-2451), the Ca/b- 
promoter (see for instance US-A-5,656,496, US-A-5,639,952, Bansal et al., Proc. 
Natl. Acad. Sci. USA 89 (1992), 3654-3658) and the Rubisco SSU promoter (see for 
instance US-A-5,034,322; US-A-4,962,028) or the glutelin promoter from wheat 
which lends itself to endosperm-specific expression (HMW promoter) (Anderson, 
Theoretical and Applied Genetics 96, (1998), 568-576, Thomas, Plant Cell 2 (12), 
(1990), 1171-1180), the glutelin promoter from rice (Takaiwa, Plant Mol. Biol. 30(6) 
(1996), 1207-1221, Yoshihara, FEBS Lett. 383 (1996), 213-218, Yoshihara, Plant 
and Cell Physiology 37 (1996), 107-111), the shrunken promoter from maize (Maas, 
EMBO J. 8 (11) (1990), 3447-3452, Werr, Mol. Gen. Genet. 202(3) (1986), 471-475, 
Werr, Mol. Gen. Genet. 212(2), (1988), 342-350), the USP promoter, the phaseolin 
promoter (Sengupta-Gopalan, Proc. Natl. Acad. Sci. USA 82 (1985), 3320-3324, 
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Bustos, Plant Cell 1 (9) (1989), 839-853) or promoters of zein genes from maize 
(Pedersen et al., Cell 29 (1982), 1015-1026; Quatroccio et al., Plant Mol. Biol. 15 
(1990), 81-93). However, promoters which are only activated at a point in time 
determined by external influences can also be used (see for instance WO 93/07279). 
In this connection, promoters of heat shock proteins which permit simple induction 
may be of particular interest. Likewise, artificial and/or chemically inducible promoters 
may be used in this context. Moreover, seed-specific promoters such as the USP 
promoter from Vicia faba which ensures a seed-specific expression in Vicia faba and 
other plants may be used (Fiedler et al., Plant Mol. Biol. 22 (1993), 669-679; 
Baumlein et al., Mol. Gen. Genet. 225 (1991), 459-467). Moreover, fruit-specific 
promoters, such as described in WO 91/01373 may be used too. In one embodiment, 
promoters which ensure constitutive expression are preferred. However, in another 
preferred embodiment, the polynucleotide may be operatively linked to a promoter 
which is inducible upon pathogen attack. 

Moreover, the polynucleotide may be linked to a termination sequence which serves 
to terminate transcription correctly and to add a poly-A-tail to the transcript which is 
believed to have a function in the stabilization of the transcripts. Such elements are 
described in the literature (see for instance Gielen et al., EMBO J. 8 (1989), 23-29) 
and can be replaced at will. 

Furthermore, if needed, polypeptide expression can in principle be targeted to any 
sub-localization of plant cells (e.g. cytosol, plastids, vacuole, mitochondria) or the 
plant (e.g. apoplast). In order to achieve the localization in a particular compartment, 
the coding region to be expressed may be linked to DNA sequences encoding a 
signal sequence (also called "transit peptide") ensuring localization in the respective 
compartment. It is evident that these DNA sequences are to be arranged in the same 
reading frame as the coding region to be expressed. Preferably, signal sequences 
directing expression into the apoplast are used in connection with the present 
invention. 

In order to ensure the location in the plastids it is conceivable to use one of the 
following transit peptides: of the plastidic Ferredoxin: NADP+ oxidoreductase (FNR) 
of spinach which is enclosed in Jansen et al. (Current Genetics 13 (1988), 517-522). 
In particular, the sequence ranging from nucleotides -171 to 165 of the cDNA 
sequence disclosed therein can be used which comprises the 5' non-translated 
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region as well as the sequence encoding the transit peptide. Another example is the 
transit peptide of the waxy protein of maize including the first 34 amino acid residues 
of the mature waxy protein (Klosgen et al., Mol. Gen. Genet. 217 (1989), 155-161). It 
is also possible to use this transit peptide without the first 34 amino acids of the 
mature protein. Furthermore, the signal peptides of the ribulose bisphosphate 
carboxylase small subunit (Wolter et al., Proc. Natl. Acad. Sci. USA 85 (1988), 846- 
850; Nawrath et al.. Proc. Natl. Acad. Sci. USA 91 (1994), 12760-12764), of the 
NADP malat dehydrogenase (Gallardo et al., Planta 197 (1995), 324-332), of the 
glutathione reductase (Creissen et al., Plant J. 8 (1995), 167-175) or of the R1 
protein (Lorberth et al. Nature Biotechnology 16, (1998), 473-477) can be used. 
In order to ensure the location in the vacuole, it is conceivable to use one of the 
following transit peptides: the N-terminal sequence (146 amino acids) of the patatin 
protein (Sonnewald et al., Plant J. 1 (1991), 95-106) or the signal sequences 
described by Matsuoka and Neuhaus (Journal of Experimental Botany 50 (1999), 
165-174); Chrispeels and Raikhel (Cell 68 (1992), 613-616); Matsuoka and 
Nakamura (Proc. Natl. Acad. Sci. USA 88 (1991), 834-838); Bednarek and Raikhel 
(Plant Cell 3 (1991), 1195-1206); and Nakamura and Matsuoka (Plant Phys. 101 
(1993), 1-5). 

In order to ensure the localization in the mitochondria, it is for example conceivable to 
use the transit peptide described by Braun (EMBO J. 1 1 , (1992), 3219-3227). 
In order to ensure the localization in the apoplast, it is conceivable to use one of the 
following transit peptides: signal sequence of the proteinase inhibitor ll-gene (Keil et 
al., Nucleic Acid Res. 14 (1986), 5641-5650; von Schaewen et al., EMBO J. 9 (1990), 
30-33), of the levansucrase gene from Erwinia amylovora (Geier and Geider, Phys. 
Mol. Plant Pathol. 42 (1993), 387-404), of a fragment of the patatin gene B33 from 
Solanum tuberosum, which encodes the first 33 amino acids (Rosahl et al., Mol Gen. 
Genet. 203 (1986), 214-220) or of the one described by Oshima et al. (Nucleic Acid 
Res. 18 (1990), 181). 

The transgenic plants of the invention may, in principle, be plants of any plant 
species. They may be both monocotyledonous and dicotyledonous plants. 
Preferably, the plants are useful plants, i.e. commercially important plants, cultivated 
by man for nutrition or for technical, in particular industrial, purposes. They may be 
sugar storing and/or starch-storing plants, for instance cereal species (rye, barley, 
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oat, wheat, maize, millet, sago etc.), rice, pea, marrow pea, cassava, sugar cane, 
sugar beet and potato; tomato, rape, soybean, hemp, flax, sunflower, cow pea or 
arrowroot, fiber-forming plants (e.g. flax, hemp, cotton), oil-storing plants (e.g. rape, 
sunflower, soybean) and protein-storing plants (e.g. legumes, cereals, soybeans). 
Preferably, the invention refers to Brassicaceae species such as cabbage varieties or 
rape seed. The plants within the scope of the invention also include fruit trees, palms 
and other trees or wooden plants being of economical value such as in forestry. 
Moreover, the method of the invention relates to forage plants (e.g. forage and 
pasture grasses, such as alfalfa, clover, ryegrass) and vegetable plants (e.g. tomato, 
lettuce, chicory) and ornamental plants (e.g. roses, tulips, hyacinths). 

According to the provisions of the invention, transgenic plants can be prepared by 
introducing a polynucleotide into plant cells and regenerating the transformed cells to 
plants by methods well known to the person skilled in the art. 

Methods for the introduction of foreign genes into plants are also well known in the art. 
These include, for example, the transformation of plant cells or tissues with T-DNA 
using Agrobacterium tumefaciens or Agrobacterium rhizogenes, the fusion of 
protoplasts, direct gene transfer (see, e.g., EP-A 164 575), injection, electroporation, 
vacuum infiltration, biolistic methods like particle bombardment, pollen-mediated 
transformation, plant RNA virus-mediated transformation, liposome-mediated 
transformation, transformation using wounded or enzyme-degraded immature 
embryos, or wounded or enzyme-degraded embryogenic callus and other methods 
known in the art. The vectors used in the method of the invention may contain further 
functional elements, for example "left border"- and "right border"-sequences of the T- 
DNA of Agrobacterium which allow stable integration into the plant genome. 
Furthermore, methods and vectors are known to the person skilled in the art which 
permit the generation of marker free transgenic plants, i.e. the selectable or scorable 
marker gene is lost at a certain stage of plant development or plant breeding. This 
can be achieved by, for example co-transformation (Lyznik, Plant Mol. Biol. 13 
(1989), 151-161; Peng, Plant Mol. Biol. 27 (1995), 91-104) and/or by using systems 
which utilize enzymes capable of promoting homologous recombination in plants 
(see, e.g., WO97/08331; Bayley, Plant Mol. Biol. 18 (1992), 353-361); Lloyd, Mol. 
Gen. Genet. 242 (1994), 653-657; Maeser, Mol. Gen. Genet. 230 (1991), 170-176; 
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Onouchi, Nucl. Acids Res. 19 (1991), 6373-6378). Methods for the preparation of 
appropriate vectors are described by, e.g., Sambrook and Russell (2001), Molecular 
Cloning: A Laboratory Manual, CSH Press, Cold Spring Harbor, NY, USA. 
Suitable strains of Agrobacterium tumefaciens and vectors as well as transformation 
of Agrobacteria and appropriate growth and selection media are well known to those 
skilled in the art and are described in the prior art (GV3101 (pMK90RK), Koncz, Mol. 
Gen. Genet. 204 (1986), 383-396; C58C1 (pGV 3850kan), Deblaere, Nucl. Acid Res. 
13 (1985), 4777; Bevan, Nucleic. Acid Res. 12(1984), 8711; Koncz, Proc. Natl. Acad. 
Sci. USA 86 (1989), 8467-8471; Koncz, Plant Mol. Biol. 20 (1992), 963-976; Koncz, 
Specialized vectors for gene tagging and expression studies. In: Plant Molecular 
Biology Manual Vol 2, Gelvin and Schilperoort (Eds.), Dordrecht, The Netherlands: 
Kluwer Academic Publ. (1994), 1-22; EP-A-120 516; Hoekema: The Binary Plant 
Vector System, Offsetdrukkerij Kanters B.V., Alblasserdam (1985), Chapter V, 
Fraley, Crit. Rev. Plant. Sci., 4, 1-46; An, EMBO J. 4 (1985), 277-287). Although the 
use of Agrobacterium tumefaciens is preferred in the method of the invention, other 
Agrobacterium strains, such as Agrobacterium rhizogenes, may be used, for example if 
a phenotype conferred by said strain is desired. 

Methods for the transformation using biolistic methods are well known to the person 
skilled in the art; see, e.g., Wan, Plant Physiol. 104 (1994), 37-48; Vasil, 
Bio/Technology 11 (1993), 1553-1558 and Christou (1996) Trends in Plant Science 1, 
423-431. Microinjection can be performed as described in Potrykus and Spangenberg 
(eds.), Gene Transfer To Plants. Springer Verlag, Berlin, NY (1995). 
The transformation of most dicotyledonous plants is possible with the methods 
described above. But also for the transformation of monocotyledonous plants several 
successful transformation techniques have been developed. These include the 
transformation using biolistic methods as, e.g., described above as well as protoplast 
transformation, electroporation of partially permeabilized cells, introduction of DNA 
using glass fibers, etc. Also, the transformation of monocotyledonous plants by means 
of Agrobacterium-based vectors has been described (Chan et al., Plant Mol. Biol. 22 
(1993), 491-506; Hiei et al., Plant J. 6 (1994) 271-282; Deng et al, Science in China 
33 (1990), 28-34; Wilmink et al, Plant Cell Reports 11 (1992), 76-80; May et al., 
Bio/Technology 13 (1995), 486-492; Conner and Dormisse, Int. J. Plant Sci. 153 
(1992), 550-555; Ritchie et al. Transgenic Res. 2 (1993), 252-265). An alternative 
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system for transforming monocotyledonous plants is the transformation by the 
biolistic approach (Wan and Lemaux, Plant Physiol. 104 (1994), 37-48; Vasil et al., 
Bio/Technology 11 (1993), 1553-1558; Ritala et al., Plant Mol. Biol. 24 (1994) 317- 
325; Spencer et al., Theor. Appl. Genet. 79 (1990), 625-631). The transformation of 
maize in particular has been repeatedly described in the literature (see for instance 
WO 95/06128, EP 0 513 849, EP 0 465 875, EP 29 24 35; Fromm et al, 
Biotechnology 8, (1990), 833-844; Gordon-Kamm et al., Plant Cell 2, (1990), 603- 
618; Koziel et al., Biotechnology 11 (1993), 194-200; Moroc et al., Theor. Appl. 
Genet. 80, (1990), 721-726). The successful transformation of other types of cereals 
has also been described for instance of barley (Wan and Lemaux, supra; Ritala et al., 
supra, Krens et al., Nature 296 (1982), 72-74), wheat (Nehra et al., Plant J. 5 (1994), 
285-297) and rice. 

The resulting transformed plant cell can then be used to regenerate a transformed 
plant in a manner known by a skilled person. 

In addition, the present invention relates to transgenic plants which show an 
increased activity of the polypeptide encoded by the polynucleotide the invention 
compared to a corresponding wild-type plant. 

The term "increased activity" refers to a significant increase of the beta-glucosidase 
activity of the polypeptide of the invention in the transgenic plant compared to a 
corresponding wild-type plant. Preferably, said activity is increased in the transgenic 
plant by at least 10%, preferably by at least 20%, more preferably by at least 50%, 
and even more preferred by at least 100% as compared to the corresponding wild- 
type plant. Beta-glucosidase activity may be determined in enzyme assays using a 
preparation from a plant sample. In particular, this assay is specific enough to 
exclude any other beta-glucosidase activity present in the plant. Advantageously, a 
substrate compound is used for such assays for which the polypeptide of the 
invention is specific and which can be detected by suitable methods known in the art. 
However, an increase of the activity of the polypeptide of the invention may also be 
inferred from a significant increase of the amount of corresponding transcript and/or 
protein present in the transgenic plant. Preferentially, transgenic plants having an 
increased activity of the polypeptide of the invention may be characterized by an 
increase of the amount of transcript corresponding to the polynucleotide of the 
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invention by at least 20%, preferably at least 50% and more preferably at least 100% 
as compared to the corresponding wild-type plant. Likewise, it is preferred that 
transgenic plants having an increased activity of the polypeptide of the invention may 
be characterized by an increase of the protein amount of the polypeptide of the 
invention by at least 20%, preferably at least 50% and more preferably at least 100% 
as compared to the corresponding wild-type plant. 

Corresponding increases of the activity of the polypeptide of the invention may for 
instance be achieved by expressing said polynucleotide in cells of a transgenic plant 
from a heterologous construct for example as described above. However, the state of 
the art provides further methods for achieving a corresponding increased activity. For 
example, the endogenous gene encoding the beta-glucosidase of the invention may 
be modified accordingly at its natural location, e.g. by homologous recombination. In 
particular, the promoter of this gene can for instance be altered in a way that 
promoter activity is enhanced. In the alternative, the coding region of the gene can be 
modified so that the encoded polypeptide shows an increased activity, e.g. by 
specifically substituting amino acid residues in the catalytically active domain of the 
polypeptide. Applicable homologous recombination techniques (also known as "in 
vivo mutagenesis") are known to the person skilled in the art and are described in the 
literature. One such technique involves the use of a hybrid RNA-DNA oligonucleotide 
("chimeroplast") which is introduced into cells by transformation (TIBTECH 15 (1997), 
441^47; W095/1 5972; Kren, Hepatology 25 (1997), 1462-1468; Cole-Strauss, Science 
273 (1996), 1386-1389). Thereby, part of the DNA component of the RNA-DNA 
oligonucleotide is homologous with the target gene sequence, however, displays in 
comparison to this sequence a mutation or a heterologous region which is 
surrounded by the homologous regions. The term "heterologous region" refers to any 
sequence that can be introduced and which is different from that to be modified. By 
means of base pairing of the homologous regions with the target sequence followed 
by a homologous recombination, the mutation or the heterologous region contained 
in the DNA component of the RNA-DNA oligonucleotide can be transferred to the 
corresponding gene. By means of in vivo mutagenesis, any part of the gene 
encoding the beta-glucosidase of the invention can be modified as long as it results 
in an increase of the activity of this protein. 



30 



In a preferred embodiment, the above-described transgenic plants show, upon an 
increased activity of the protein encoded by the polynucleotide of the invention, an 
increased resistance against a plant pathogen to which a corresponding wild-type 
plant is susceptible. 

The term "increased resistance" may refer both to an enhancement of a resistance 
already present in the wild-type plant and to the establishment of a resistance that is 
not present in the wild-type plant. 

Preferably, these transgenic plants contain a polynucleotide as defined above, i.e. a 
polynucleotide or a recombinant nucleic acid molecule that is introduced in a plant 
cell and the presence of which in the genome of said plant preferably leads to an 
increased activity of the beta-glucosidase of the invention, stably integrated into the 
genome. 

In a further aspect, the present invention relates to transgenic plants which, upon the 
presence of a suitable nucleic acid molecule in the genome of its cells, show a 
reduced activity of the beta-glucosidase of the invention. 

Such transgenic plants may be useful objects for studying the mechanism of non- 
host resistance and the role the polypeptide of the invention plays in that mechanism. 
The above explanations concerning techniques for producing transgenic plants and 
plant cells as well as suitable transformation techniques and vectors mentioned in 
connection with the transgenic plants having an increased activity of the beta- 
glucosidase of the invention may be likewise applied in the present embodiment. 
Methods for specifically reducing the activity of a protein in plant cells by the 
introduction of nucleic acid molecules are exhaustively and widely described in the 
literature and are known to the person skilled in the art. These include but are not 
limited to antisense, ribozyme, co-suppression, RNA interference, expression of 
dominant negative mutants, antibody expression and in vitro mutagenesis 
approaches. 

The term "presence of a suitable nucleic acid molecule" as used herein refers to any 
foreign nucleic acid molecule that is present in cells of a transgenic plant produced in 
accordance with the invention but absent from the cells of the corresponding source 
plant. Thereby encompassed are nucleic acid molecules, e.g. gene sequences, 
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which differ from the corresponding nucleic acid molecule in the source plant cell by 
at least one mutation (substitution, insertion, deletion, etc. of at least one nucleotide), 
wherein such a mutation inhibits the expression of the affected gene or reduces the 
activity of the gene product. Furthermore encompassed by the term "foreign" are 
nucleic acid molecules which are homologous with respect to the source plant cell 
but are situated in a different chromosomal location or differ, e.g., by way of a 
reversed orientation for instance with respect to the promoter. 

In principle, the nucleic acid molecule to be introduced in accordance with the 
present embodiment may be of any conceivable origin, e.g. eukaryotic or prokaryotic. 
It may be from any organism which comprises such molecules. Furthermore, it may 
be synthetic or derived from naturally occurring molecules by, e.g., modification of its 
sequence, i.e. it may be a variant or derivative of a naturally occurring molecule. 
Such variants and derivatives include but are not limited to molecules derived from 
naturally occurring molecules by addition, deletion, mutation of one or more 
nucleotides or by recombination. It is, e.g., possible to change the sequence of a 
naturally occurring molecule so as to match the preferred codon usage of plants, in 
particular of those plants in which the nucleic acid molecule shall be expressed. 

It is furthermore preferred that the nucleic acid molecule introduced into a plant cell in 
accordance with the present embodiment has to be expressed in the transgenic plant 
in order to exert the reducing effect upon beta-glucosidase activity. The term 
"expressed" means for such a nucleic acid molecule that it is at least transcribed, and 
for some embodiments also translated into a protein, in at least some of the cells of 
the plant. Preferred examples of such nucleic acid molecules relate to those 
embodiments of the transgenic plants of the invention wherein said reduced beta- 
glucosidase activity is achieved by an antisense, co-suppression, ribozyme or RNA 
interference effect or by the expression of antibodies or other suitable (poly)peptides 
capable of specifically reducing said acitvity or by the expression of a dominant- 
negative mutant. These methods are further explained in the following. 

Accordingly, the use of nucleic acid molecules encoding an antisense RNA which is 
complementary to transcripts of a gene encoding a plant beta-glucosidase of the 
invention is a preferred embodiment of the present invention. Thereby, complementarity 
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does not signify that the encoded RNA has to be 100% complementary. A low degree 
of complementarity may be sufficient as long as it is high enough to inhibit the 
expression of such a beta-glucosidase protein upon expression of said RNA in plant 
cells. The transcribed RNA is preferably at least 90% and most preferably at least 95% 
complementary to the transcript of the nucleic acid molecule encoding PEN2. In order 
to cause an antisense effect during the transcription in plant cells such RNA molecules 
have a length of at least 15 bp, preferably a length of more than 100 bp and most 
preferably a length or more than 500 bp, however, usually less than 2000 bp, preferably 
shorter than 1500 bp. Exemplary methods for achieving an antisense effect in plants 
are for instance described by Muller-Rober (EMBO J. 11 (1992), 1229-1238), 
Landschutze (EMBO J. 14 (1995), 660-666), D'Aoust (Plant Cell 11 (1999), 2407-2418) 
and Keller (Plant J. 19 (1999), 131-141) and are herewith incorporated in the 
description of the present invention. Likewise, an antisense effect may also be achieved 
by applying a triple-helix approach, whereby a nucleic acid molecule complementary to 
a region of the gene, encoding the relevant beta-glucosidase, designed according to 
the principles for instance laid down in Lee (Nucl. Acids Res. 6 (1979), 3073); Cooney 
(Science 241 (1998), 456) or Dervan (Science 251 (1991), 1360) may inhibit its 
transcription. 

A similar effect as with antisense techniques can be achieved by producing transgenic 
plants expressing suitable constructs in order to mediate an RNA interference (RNAi) 
effect. Thereby, the formation of double-stranded RNA leads to an inhibition of gene 
expression in a sequence-specific fashion. More specifically, in RNAi constructs, a 
sense portion comprising the coding region of the gene to be inactivated (or a part 
thereof, with or without non-translated region) is followed by a corresponding antisense 
sequence portion. Between both portions, an intron not necessarily originating from the 
same gene may be inserted. After transcription, RNAi constructs form typical hairpin 
structures. In accordance with the teachings of the present invention, the RNAi 
technique may be carried out as described by Smith (Nature 407 (2000), 319-320) or 
Marx (Science 288 (2000), 1 370-1 372). 

Also DNA molecules can be employed which, during expression in plant cells, lead to 
the synthesis of an RNA which reduces the expression of the gene encoding the beta- 
glucosidase of the invention in the plant cells due to a co-suppression effect. The 
principle of co-suppression as well as the production of corresponding DNA sequences 



33 



is precisely described, for example, in WO 90/12084. Such DNA molecules preferably 
encode an RNA having a high degree of homology to transcripts of the target gene. It 
is, however, not absolutely necessary that the coding RNA is translatable into a protein. 
The principle of the co-suppression effect is known to the person skilled in the art and 
is, for example, described in Jorgensen, Trends Biotechnol. 8 (1990), 340-344; Niebel, 
Curr. Top. Microbiol. Immunol. 197 (1995), 91-103; Flavell, Cunr. Top. Microbiol. 
Immunol. 197 (1995), 43-36; Palaqui and Vaucheret, Plant. Mol. Biol. 29 (1995), 149- 
159; Vaucheret, Mol. Gen. Genet. 248 (1995), 311-317; de Borne, Mol. Gen. Genet. 
243 (1994), 613-621 and in other sources. 

Likewise, DNA molecules encoding an RNA molecule with ribozyme activity which 
specifically cleaves transcripts of a gene encoding the relevant beta-glucosidase can 
be used. Ribozymes are catalytically active RNA molecules capable of cleaving RNA 
molecules and specific target sequences. By means of recombinant DNA techniques, 
it is possible to alter the specificity of ribozymes. There are various classes of 
ribozymes. For practical applications aiming at the specific cleavage of the transcript 
of a certain gene, use is preferably made of representatives of the group of 
ribozymes belonging to the group I intron ribozyme type or of those ribozymes 
exhibiting the so-called "hammerhead" motif as a characteristic feature. The specific 
recognition of the target RNA molecule may be modified by altering the sequences 
flanking this motif. By base pairing with sequences in the target molecule, these 
sequences determine the position at which the catalytic reaction and therefore the 
cleavage of the target molecule takes place. Since the sequence requirements for an 
efficient cleavage are low, it is in principle possible to develop specific ribozymes for 
practically each desired RNA molecule. In order to produce DNA molecules encoding 
a ribozyme which specifically cleaves transcripts of a gene encoding the relevant beta- 
glucosidase, for example a DNA sequence encoding a catalytic domain of a ribozyme 
is bilaterally linked with DNA sequences which are complementary to sequences 
encoding the target protein. Sequences encoding the catalytic domain may for example 
be the catalytic domain of the satellite DNA of the SCMo virus (Davies, Virology 177 
(1990), 216-224 and Steinecke, EMBO J. 11 (1992), 1525-1530) or that of the satellite 
DNA of the TobR virus (Haseloff and Gerlach, Nature 334 (1988), 585-591). The 
expression of ribozymes in order to decrease the activity of certain proteins in cells is 
known to the person skilled in the art and is, for example, described in EP-B1 0 321 
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201. The expression of ribozymes in plant cells is for example described in Feyter 
(Mol. Gen. Genet. 250 (1996), 329-338). 

Furthermore, nucleic acid molecules encoding antibodies specifically recognizing the 
relevant beta-glucosidase in a plant, i.e. specific fragments or epitopes of such a 
protein, can be used for inhibiting the activity of this protein. These antibodies can be 
monoclonal antibodies, polyclonal antibodies or synthetic antibodies as well as 
fragments of antibodies, such as Fab, Fv or scFv fragments etc. Monoclonal antibodies 
can be prepared, for example, by the techniques as originally described in Kohler and 
Milstein (Nature 256 (1975), 495) and Galfr6 (Meth. Enzymol. 73 (1981) 3), which 
comprise the fusion of mouse myeloma cells to spleen cells derived from immunized 
mammals. Furthermore, antibodies or fragments thereof to the aforementioned 
peptides can be obtained by using methods which are described, e.g., in Harlow and 
Lane "Antibodies, A Laboratory Manual", CSH Press, Cold Spring Harbor, 1988. 
Expression of antibodies or antibody-like molecules in plants can be achieved by 
methods well known in the art, for example, full-size antibodies (During, Plant. Mol. Biol. 
15 (1990), 281-293; Hiatt, Nature 342 (1989), 469-470; Voss, Mol. Breeding 1 (1995), 
39-50), Fab-fragments (De Neve, Transgenic Res. 2 (1993), 227-237), scFvs (Owen, 
Bio/Technology 10 (1992), 790-794; Zimmermann, Mol. Breeding 4 (1998), 369-379; 
Tavladoraki, Nature 366 (1993), 469-472; Artsaenko, Plant J. 8 (1995), 745-750) and 
variable heavy chain domains (Benvenuto, Plant Mol. Biol. 17 (1991), 865-874) have 
been successfully expressed in tobacco, potato (Schouten, FEBS Lett. 415 (1997), 235- 
241) or Arabidopsis, reaching expression levels as high as 6.8% of the total protein 
(Fiedler, Immunotechnology 3 (1997), 205-216). 

Moreover, also nucleic acid molecules encoding peptides or polypeptides capable of 
reducing the activity of the relevant ABC transporter other than antibodies can be used 
in the present context. Examples of suitable peptides or polypeptides that can be 
constructed in order to achieve the intended purpose can be taken from the prior art 
and include, for instance, binding proteins such as lectins. 

In addition, nucleic acid molecules encoding a mutant form of the relevant beta- 
glucosidase can be used to interfere with the activity of the wild-type protein. Such a 
mutant form preferably has lost its biological activity, e.g. its hydrolytic activity on 
glycosidic bonds, and may be derived from the corresponding wild-type protein by way 
of amino acid deletion(s), substitution(s), and/or additions in the amino acid sequence of 
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the protein. Mutant forms of such proteins may show, in addition to the loss of the 
hydrolytic activity, an increased substrate affinity and/or an elevated stability in the 
cell, for instance, due to the incorporation of amino acids that stabilize proteins in the 
cellular environment. These mutant forms may be naturally occurring or, as preferred, 
genetically engineered mutants. 

In another preferred embodiment, the nucleic acid molecule, the presence of which in 
the genome of a plant cell leads to a reduction of the activity of the beta-glucosidase 
of the invention, does not require its expression to exert its reducing effect on beta- 
glucosidase activity. Correspondingly, preferred examples relate to methods wherein 
said reduced beta-glucosidase activity is achieved by in vivo mutagenesis or by the 
insertion of a heterologous DNA sequence in the gene encoding the beta- 
glucosidase of the invention. 

The term "in vivo mutagenesis", relates to methods where the sequence of the gene 
encoding the relevant beta-glucosidase is modified at its natural chromosomal 
location such as for instance by techniques applying homologous recombination. 
This may be achieved by using a hybrid RNA-DNA oligonucleotide ("chimeroplast") 
which is introduced into cells by transformation (TIBTECH 15 (1997), 441-447; 
W095/15972; Kren, Hepatology 25 (1997), 1462-1468; Cole-Strauss, Science 273 
(1996), 1386-1389). Part of the DNA component of the RNA-DNA oligonucleotide is 
homologous to the target ABC transporter gene sequence, however, displays in 
comparison to this sequence a mutation or a heterologous region which is 
surrounded by the homologous regions. The term "heterologuous region" 
corresponds to any sequence that can be introduced and encompasses, for instance, 
also sequences from the same beta-glucosidase gene but from a different site than 
that which is to be mutagenized. By means of base pairing of the homologous 
regions with the target sequence followed by a homologous recombination, the 
mutation or the heterologous region contained in the DNA component of the RNA- 
DNA oligonucleotide can be transferred to the corresponding gene of the plant cell. 
By means of in vivo mutagenesis, any part of the gene encoding the beta- 
glucosidase can be modified as long as it results in a decrease of the beta- 
glucosidase activity. Thus, in vivo mutagenesis can for instance concern, the 
promoter, e.g. the RNA polymerase binding site, as well as the coding region, in 
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particular those parts encoding the substrate binding site or the catalytically active 
site or a signal sequence directing the protein to the appropriate cellular 
compartment. 

The term "heterologous DNA sequence" refers to any DNA sequences which can be 
inserted into the target gene via appropriate techniques other than those described 
above in connection with in vivo mutagenesis. The insertion of such a heterologous 
DNA sequence may be accompanied by other mutations in the target gene such as 
the deletion, inversion or rearrangement of the sequence located at the insertion site. 
This embodiment of the invention includes that the introduction of a nucleic acid 
molecule leads to the generation of a pool, i.e. a plurality, of transgenic plants in the 
genome of which the nucleic acid molecule, i.e. the heterologous DNA sequence, is 
randomly spread over various chromosomal locations, and that this generation of 
transgenic plants is followed by selecting those transgenic plants out of the pool 
which show the desired genotype, i.e. an inactivating insertion in the relevant beta- 
glucosidase gene and/or the desired phenotype, i.e. a reduced beta-glucosidase 
activity and/or reduced non-host resistance as for example described for the pen2 
mutant plants (see Examples 2 and 3). 

Suitable heterologous DNA sequences that can be taken for such an approach are 
described in the literature and include, for instance vector sequences capable of self- 
integration into the host genome or mobile genetic elements. Particularly preferred in 
this regard are T-DNA or transposons which are well-known to the person skilled in 
the art from so-called tagging experiments used for randomly knocking out genes in 
plants. The production of such pools of transgenic plants can for example be carried 
out as described in Jeon (Plant J. 22 (2000), 561-570) or Parinov (Curr. Op. 
Biotechnol. 11 (2000), 157-161). 

Another example of insertional mutations that may result in gene silencing includes 
the duplication of promoter sequences which may lead to a methylation and thereby 
an inactivation of the promoter (Morel, Current Biology 10 (2000), 1591-1594). 
Furthermore, it is immediately evident to the person skilled in the art that the above- 
described approaches, such as antisense, ribozyme, co-suppression, in-vivo 
mutagenesis, RNAi, expression of antibodies, other suitable peptides or polypeptides 
or dominant-negative mutants and the insertion of heterologous DNA sequences, can 
also be used for the reduction of the expression of genes that encode a regulatory 
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protein such as a transcription factor, that controls the expression of the relevant 
beta-glucosidase or, e.g., proteins that are necessary for the beta-glucosidase to 
become active. 

It is also evident from the disclosure of the present invention that any combination of 
the above-identified approaches can be used for the generation of transgenic plants, 
which due to the one or more of the above-described nucleic acid molecules in their 
cells, display a reduced activity of the relevant beta-glucosidase compared to 
corresponding source plants. Such combinations can be made, e.g., by (co-) 
transformation of corresponding nucleic acid molecules into the plant cell, plant 
tissue or plant or by crossing transgenic or mutant plants that have been generated 
according to different techniques. Likewise, the transgenic plants of the present 
invention showing a reduced activity of the beta-glucosidase of the invention can be 
crossed with plants, e.g. transgenic plants, having other desired traits. 

The invention also relates to propagation material of the transgenic plants of the 
invention comprising plant cells according to the invention. The term "propagation 
material" comprises those components or parts of the plant which are suitable to 
produce offspring vegetatively or generatively. Suitable means for vegetative 
propagation are for instance cuttings, callus cultures, rhizomes or tubers. Other 
propagation material includes for instance fruits, seeds, seedlings, protoplasts, cell 
cultures etc. The preferred propagation materials are tubers and seeds. 
The invention also relates to harvestable parts of the plants of the invention such as, 
for instance, fruits, seeds, tubers, rootstocks, leaves or flowers. 

Corresponding to the above explanations, the invention furthermore relates to a 
method for conferring pathogen resistance or increased pathogen resistance to a 
plant comprising the step of providing a transgenic plant in which the activity of the 
polypeptide encoded by the above-described polynucleotide of the invention is 
increased. 

In a further aspect, the present invention relates to methods for identifying a 
compound that is hydrolyzed by the polypeptide of the invention. Some of the 
methods may comprise the step of contacting a candidate compound with an effector 
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molecule (such as said polypeptide). This step may be accomplished by adding a 
sample containing said candidate compound or a plurality of candidate compounds to 
the assay mixture. If such a sample or plurality of compounds is identified in one of 
the methods to contain a compound of interest, then it is either possible to isolate the 
compound from the original sample or one can further subdivide the original sample, 
for example, if it consists of a plurality of different compounds, so as to reduce the 
number of different substances per sample and repeat the method with the 
subdivisions of the original sample. Depending on the complexity of the samples, the 
steps described herein can be performed several times, preferably until the sample 
identified according to the method of the invention only comprises a limited number 
of or only one substance(s). Preferably said sample comprises substances of similar 
chemical and/or physical properties, and most preferably said substances are 
identical. 

Furthermore, a compound identified by said method(s) may be known in the art but 
hitherto not known to be a substrate for the polypeptide of the invention, preferably it 
was not known that said compound or a hydrolysis product thereof is capable of 
increasing resistance against pathogens in plants. 

Accordingly, in one aspect, the present invention relates to a method for identifying a 
compound that is hydrolyzed by a polypeptide encoded by the above-described 
polynucleotide of the invention comprising the steps of 

(a) contacting a candidate compound with said polypeptide under conditions 
where said polypeptide is active; and 

(b) determining whether said candidate compound is hydrolyzed by said 
polypeptide. 

The polypeptide to be used in this method can be provided as described above, for 
example from a natural source from which the polypeptide preferably is purified by 
applying suitable techniques known in the art, by chemical synthesis or by 
recombinant expression. Preferentially, the polypeptide is expressed in a prokaryotic 
host, preferably a bacterial cell such as an Escherichia coli cell. The method for 
identifying a substrate compound may be performed using the cell culture medium or 
the supernatant thereof into which the polypeptide has been secreted. Also, the 
polypeptide may be purified to a degree which is necessary by conventional 
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techniques in order to obtain significant results in said method. In a preferred 
embodiment, the polypeptide may be used in an immobilized form, i.e. directly or 
indirectly via the linkage over suitable intermediate molecules (e.g. peptide linkers, 
antibodies and the like) attached to a solid support. In a further preferred 
embodiment, the method for identifying a substrate compound may be set up in a 
high-throughput fashion. Suitable strategies in order to achieve high-throughput scale 
for enzyme assays are known to a skilled person and are described in the literature. 
In the prior art, several approaches are described for overexpression of a polypeptide 
and activity determination that can be adapted for use in the present method of the 
invention such as the approaches described by Minami (Plant Cell Physiol. 41 
(2000), 218-225), Chen (Protein Expression and Purification 17 (1999), 414-421) and 
Dharmawardhana (Plant Mol. Biol. 40 (1999), 365-372). 

In addition, the present invention relates to a method for identifying a compound that 
is hydrolyzed by a polypeptide encoded by the above-described polynucleotide of the 
invention comprising the steps of 

(a) providing a 3-dimensional structure model of said polypeptide; and 

(b) determining the structure of a substrate that fits into the 3-dimensional 
structure model of (a). 

Three-dimensional structure models of the polypeptide of the invention may be 
provided according to appropriate techniques known to a person skilled in the art and 
described in the literature. Conventionally, three-dimensional data on protein 
structure can be obtained from x-ray crystallography. For the purposes of the present 
invention, crystallization, structure determination and data processing and 
representation in a three-dimensional model may be carried out as described in 
Czjzek (Proc. Natl. Acad. Sci. USA 97 (2000), 13555-13560). In this publication, 
results are described on x-ray crystallography of an inactive mutant of the maize 
beta-glucosidase Glu1 and the structural elements that determine substrate 
specificity of the natural substrate DIMBOAGIc as well as analogs of this compound. 
Therein, it was confirmed that, for beta-glucosidases, specificity is determined by the 
aglycone portion of the substrate. Other instructive examples for corresponding 
approaches are described in Czjzek (Biochem. J. 354 (2001), 37-46) and Burmeister 
(Structure 5 (1997), 663-675). 
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For obtaining information on the structure of the substrate via three-dimensional 
structure data, the wild-type polypeptide, mutants thereof such as catalytically 
inactive variants or fragments bearing the substrate-binding portion of the 
polypeptide can be used for crystallization. 

In yet another embodiment, the invention relates to a method for identifying a 
compound that is hydrolyzed by a polypeptide encoded by the above-described 
polynucleotide of the invention comprising the steps of 

(a) providing a mutant protein of said polypeptide the catalytic activity of which is 
abolished without loosing substrate binding activity; 

(b) contacting a candidate compound with said mutant protein; and 

(c) determining whether the candidate compound is bound by said mutant protein. 
This method is based on the assumption that the substrate compound to be identified 
is characterized by its ability to be bound by the polypeptide of the invention. Thus, 
by identifying compounds that are bound by said polypeptide, one may restrict the 
scope of potential substrate molecules to a high degree, if not to the actual substrate 
molecule itself. The technique applied in this method is also known as "substrate 
trap". For instance, Flint (Proc. Natl. Acad. Sci. USA 95 (1997), 1680-1685) applied 
substrate-trapping in order to identify physiological substrates of protein tyrosine 
phosphatases. 

The provision of mutant proteins the catalytic activity of which is abolished without 
loosing substrate binding activity can be accomplished based on information on the 
amino acid residues that are known to be necessary for catalytic activity in family 1 beta- 
glucosidases. Such information is available from the prior art. For instance, the glutamic 
acid residues (E) in the conserved family 1 beta-glucosidase motifs TFNEP and 
(IA/)TENG (SEQ ID NOs:3 and 4) are known to be involved in the catalytic reactions and 
are therefore preferred targets for producing catalytically inactive mutants of the 
polypeptide of the invention. Czjzek (Proc. Natl. Acad. Sci. USA 97 (2000), 13555-13560) 
produced such a mutant by substituting the glutamic acid residue at position 191 by an 
aspartate residue (E191D). Accordingly, such a mutant may be produced starting from 
the polypeptide of the invention by substituting the amino acid residue corresponding to 
the glutamate residue at position 183 of SEQ ID NO: 2 by, e.g. an aspartate residue. 
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Inactivation of the polypeptide of the invention may be carried out according to suitable 
techniques known in the prior art and comprise recombinant methods as well as post- 
translational, for example chemical modifications. Recombinant methods include the 
introduction of mutations such as substitutions, additions, deletions, inversions or 
transversions into the amino acid sequence of the polypeptide as they are for example 
described in Sambrook and Russell (2001), Molecular Cloning: A Laboratory Manual, 
CSH Press, Cold Spring Harbor, NY, USA. These may for instance be accomplished by 
in vitro mutagenesis, e.g. based on PCR, or in vivo mutagenesis. In a preferred 
embodiment of the present method, the polypeptide may be used in an immobilized form, 
i.e. directly or indirectly via the linkage over suitable intermediate molecules (e.g. peptide 
linkers, antibodies and the like) attached to a solid support. 

The candidate compounds to be tested in the present method can in principle be 
taken from any source that can be expected to contain potential beta-glucosidase 
substrates. Plant beta-glucosidases are for example known to be involved in the 
defence against pests, in lignification and in cell wall catabolism. Thus, preferably 
candidate compounds may be taken from corresponding sources, i.e. plant 
pathogens (e.g. elicitors), lignin precursors or cell wall components or precursors 
thereof. As it has been shown in Example 4, the pen2 mutant plants have an altered 
cell wall composition as compared to corresponding wild-type plants. Thus, it is 
particularly preferred to take the candidate compounds for use in the present method 
from the cell wall or cell wall precursor substances, preferably from the plant species 
from which the polypeptide used in the method is derived. Practically, compound 
libraries that are routinely used for screenings may be of use herein. 
Contacting a candidate compound with the mutant protein should be carried out under 
conditions which allow binding of the substrate compound by the mutant protein. 
Preferably, the mutant protein is contacted with a plurality of candidate compounds. After 
the contacting step, the unbound candidate compounds can be removed from the assay 
mixture and subsequently the compound bound by the mutant protein, if any, can be 
examined in order to determine its identity. For this purpose, advantageously, the bound 
compound is released from the mutant protein prior to determination by taking 
appropriate measures. Determination of the compound's identity can be done using 
according techniques known in the prior art such as thin layer chromatography (TLC), 
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capillary electrophoresis (CE), gas chromatography (GC), mass spectrometry (MS), liquid 
chromatography (LC), NMR or FT-IR or combinations thereof such as GC/MS or LC/MS. 
Likewise, whether the candidate compound is bound by the mutant protein may be 
determined by removing any unbound or non-specifically bound candidate compound 
contacted with the mutant protein from the assay mixture, followed by changing the assay 
conditions such that any specifically bound compound is released from the mutant 
protein. Thereby, it is possible to determine whether a compound is released and thereby 
determining whether the candidate compound is bound by the mutant protein and, 
optionally and in case there has been bound a compound, its identity may subsequently 
be identified using suitable techniques. 

Moreover, it is evident to a person skilled in the art that the methods for identifying a 
compound that is hydrolyzed by a polypeptide of the invention as they are explained 
above can be combined. Thus, it is for instance possible in a first stage to screen a 
large set of compounds, possibly also pools of compounds, in substrate trap 
experiments or by obtaining insights into the molecular structure of potential 
substrate compounds through the use of 3-dimensional structure data. Then, in a 
second stage of screening, pre-selected candidate compounds can be subjected to 
hydrolysis assays using catalytically active polypeptide preparations. 

In a preferred embodiment, the above-outlined methods for identifying a compound 
that is hydrolyzed by the polypeptide of the invention furthermore comprise the step 
of determining whether the identified compound or, as it is preferred, a hydrolysis 
product thereof is capable of inducing or enhancing a defence response against a 
pathogen in a plant. 

In this additional step within the process of identifying a substrate compound to the 
polypeptide of the invention, the compound that is identified by any one of the above- 
outlined methods or combinations thereof is further tested for the ability to activate 
defence against a pathogen in a plant. It is to be noted that this requires that the 
plant used expresses the polypeptide of the invention that is specific for the substrate 
compound used since it is contemplated that the aglucone product of said compound 
is the form active in plant defence. Therefore, in a preferred embodiment, plants are 
used in the present method that express the specific polypeptide of the invention, 
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either naturally or, which is especially preferred, because they are transgenic plants 
transformed with a construct expressing said polypeptide. 

In any case or if the respective plant does not express this polypeptide, one or more 
hydrolysis products of the compound may be used in order to test for defence- 
activating activity. The term "hydrolysis product" as used throughout the present 
application refers to the products that result from a hydrolysis reaction acting on a 
substrate molecule catalyzed by the polypeptide of the invention. For example, the 
hydrolysis product may be the aglycone portion of a substrate which is a conjugate 
compound. Aglycone portions of beta-glucosidase substrates are known to be 
diverse with regard to their chemical nature and their biological function. Such 
aglycones may for instance comprise plant hormones, flavonoids or cyanogenic 
compounds. The aglycon product of the hydrolyzed substrates can serve a multitude 
of functions including growth and development (Selmar et al., 1987; Brzobohaty et 
al., 1993; Dietz et al., 2000), cell wall catabolism (Leah et al., 1995; Gerardi et al., 
2001), lignification (Dharmawardhana et al., 1995 and 1999), and defense (Zheng 
and Poulton, 1995; Rsak et al., 2000). However, if for example the substrate of the 
polypeptide of the invention is an oligo- or polysaccharide, both hydrolysis products 
may be carbohydrates or at least compounds comprising glycosidic portions. For 
instance Gerardi (Plant Science 160 (2001), 795-805) describe beta-glucosidases 
that can efficiently cleave cell wall oligo- or polysaccharides. 

Hydrolysis products can be produced by the use of the polypeptide of the invention 
and optionally purified according to methods described in the prior art. However, 
once the identity of the products is known by way of having applied the above- 
described method(s) for identifying a substrate compound of the polypeptide of the 
invention, the products may likewise be provided by other methods than by using the 
polypeptide of the invention which are known in the art such as by isolation from 
natural sources or by chemical synthesis. 

In a preferred embodiment of the method for identifying substrate compounds, 
mutant plants are used, wherein the gene encoding the polypeptide of the invention 
is inactive such as in the pen2 mutant described in connection with the present 
invention. 

For the purpose of testing whether a defence response against a pathogen is 
induced or enhanced, test plants or parts thereof are inoculated with a corresponding 
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pathogen in the presence and in the absence of the compound or a hydrolysis 
product thereof. Instead of inoculation, the plant or part thereof may alternatively be 
treated with an elicitor if appropriate. Susceptibility to the pathogen may then be 
examined according to methods described in the prior art or as exemplified in the 
appended Examples. A reduction of susceptibility due to the presence of the 
compound or a hydrolysis product thereof is indicative for the ability of said 
compound or hydrolysis product thereof to induce or enhance a defence response 
against a pathogen in a plant. The term "inducing a defence response" refers to the 
situation where the plant does normally not show a significant defence reaction to the 
respective pathogen, for instance when a compatible interaction takes place with the 
pathogen. On the other hand, "enhancing a defence response" refers to the situation 
where the plant normally shows a defence response, e.g. in an incompatible 
interaction with the pathogen or as part of a non-host resistance, but the presence of 
the compound or hydrolysis product thereof results in an enhancement of the 
defence reaction(s) with the effect that the plant is less susceptible to the pathogen. 
"Less susceptible" may mean in this context a reduction of successful colonization of 
the pathogen into plant tissue by at least 10%, preferably by at least 20%, more 
preferably by at least 50% and even more preferably by at least 80%. The term 
"successful colonisation" refers to colonization of the pathogen on or in the plant 
which are stable, i.e. in contrast to colonization which cease after a certain while due 
to lacking nutrition for the pathogen. With regard to fungal pathogens, like for 
instance powdery mildew, such a successful colonisation is for instance apparent 
from the formation of a haustorium and of secondary hyphae. 

In yet another embodiment, the present invention relates to a method for preparing a 
plant protection composition comprising the steps of at least one of the methods for 
identifying a compound that is hydrolyzed by a polypeptide of the invention described 
above and furthermore the step of formulating the identified compound or a 
hydrolysis product thereof preferably being capable of inducing or enhancing a 
defense response against a pathogen in a plant in a form suitable for administering to 
plants. 

The compound identified according to the above-described method(s) or an analog 
or derivative thereof may be further formulated in a form suitable for the application in 
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plant cultivation. For example, it can be combined with a agriculturally acceptable 
carrier known in the art. The plant protection composition can be prepared by 
employing one or more of the above-described methods for identifying the compound 
and synthesizing the compound or a hydrolysis product thereof in an amount 
sufficient for use in agriculture. Thus, the term "formulating" also encompasses 
synthesizing the compound so-identified or a hydrolysis product thereof or an analog 
or derivative thereof. "Analogs or derivatives" refer to compounds that show 
substantially the same activity with respect to the potential to increase resistance in 
plants as the originally identified compound or a hydrolysis product thereof aind that 
are immediately recognizable by a person skilled in the art in the field of 
agrochemicals once he/she is aware of the originally identified compound or a 
hydrolysis product thereof. 

In the plant protection composition of the invention, the compound identified by the 
above-described method or a hydrolysis product thereof may be formulated by 
conventional means commonly used for the application of, for example, herbicides 
and pesticides or agents capable of inducing systemic acquired resistance (SAR). 
For example, certain additives known to those skilled in the art such as stabilizers or 
substances which facilitate the uptake by the plant cell, plant tissue or plant may be 
used as for example harpins, elicitins, salicylic acid (SA), benzol(1,2,3)thiadiazole-7- 
carbothioic acid (BTH), 2,6-dichloro isonicotinic acid (INA), jasmonic acid (JA) or 
methyljasmonate. 

Accordingly, the present invention furthermore relates to a compound, a hydrolysis 
product thereof or a plant protection composition that is identified or obtained by any 
one of the corresponding methods described above. 

In addition, the present invention relates to a kit comprising the polynucleotide, the 
recombinant nucleic acid molecule, the vector, the polypeptide or the antibody of the 
invention or the compound that is hydrolyzed by the polypeptide of the invention or a 
hydrolysis product thereof. 

The kit of the invention may contain further ingredients such as selection markers 
and components for selective media suitable for the generation of transgenic plant 
cells, plant tissue or plants. Furthermore, the kit may include materials such as 
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buffers and candidate substrates for carrying out one of the above-described 
methods for identifying compounds that are hydrolyzed by the polypeptide of the 
invention. Thus, the kit may for example contain nucleic acid molecules useful as 
probes for identifying an ortholog of the Arabidopsis beta-glucosidase gene disclosed 
herein in other plant species. Furthermore, the kit may contain materials useful for 
expressing the polypeptide and/or for crystallizing it. The kit of the invention may 
advantageously be used for carrying out the methods of the invention and could be, 
inter alia, employed in a variety of applications referred to herein, e.g., as research 
tool. The parts of the kit of the invention can be packaged individually in vials or in 
combination in containers or multicontainer units. Manufacture of the kit follows 
preferably standard procedures which are known to the person skilled in the art. 
The kit or its ingredients according to the invention can be used, for example, for any 
of the above described methods for identifying compounds that enhance or establish 
non-host resistance in plants. The kit of the invention and its ingredients are 
expected to be very useful in breeding new varieties of, for example, plants which 
display improved properties such as disease resistance. 

It is also immediately evident to the person skilled in the art that the polynucleotides, 
recombinant nucleic acid molecules and vectors of the present invention can be 
employed to produce transgenic plants with a desired trait (see for review TIPTEC 
Plant Product & Crop Biotechnology 13 (1995), 312-397) comprising (i) insect 
resistance (Vaek, Plant Cell 5 (1987), 159-169), (ii) virus resistance (Powell, Science 
232 (1986), 738-743; Pappu, World Journal of Microbiology & Biotechnology 11 
(1995), 426-437; Lawson, Phytopathology 86 (1996), 56 suppl.), (iii) resistance to 
bacteria, insects and fungi (Dueling, Molecular Breeding 2 (1996), 297-305; 
Strittmatter, Bio/Technology 13 (1995), 1085-1089; Estruch, Nature Biotechnology 15 
(1997), 137-141), or (iv) as a genetic marker useful in breeding plants with an 
improved resistance to pathogens. 

In a further embodiment, the present invention relates to the use of the 
polynucleotide, of the recombinant nucleic acid molecule, of the vector, of the host 
cell, of the polypeptide, of the antibody or of the transgenic plant of the invention, 
said matters being described above in detail, for identifying a compound that is 
hydrolyzed by a polypeptide encoded by said polynucleotide. 
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For this use, preferably, any one of the above-mentioned methods for identifying 
such a compound may be applied. However, the present invention additionally 
encompasses the corresponding use of the matters given above by applying other 
methods that can be envisaged. 

Another embodiment of the present invention relates to the use of the polynucleotide, 
of the recombinant nucleic acid molecule, of the vector, of the host cell, of the 
polypeptide, of the antibody or of the transgenic plant of the invention, said matters 
being described above in detail, for the preparation of a plant protection composition. 

Also within the scope of the invention is the use of the of the polynucleotide, of the 
recombinant nucleic acid molecule, of the vector, of the host cell, of the polypeptide, 
of the antibody or of the transgenic plant of the invention, said matters being 
described above in detail, for establishing or enhancing a pathogen resistance in 
plants. 

These and other embodiments are disclosed and encompassed by the description 
and examples of the present invention. Further literature concerning any one of the 
methods, uses and compounds to be employed in accordance with the present 
invention may be retrieved from public libraries, using for example electronic devices. 
For example the public database "Medline" may be utilized which is available on the 
Internet, for example under http://www.ncbi.nlm.nih.gov/PubMed/medline.html. 
Further databases and addresses, such as http://www.ncbi.nlm.nih.gov/, 
http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, 
http://www.tigr.org/, are known to the person skilled in the art and can also be 
obtained using, e.g., http://www.google.de. An overview of patent information in 
biotechnology and a survey of relevant sources of patent information useful for 
retrospective searching and for current awareness is given in Berks, TIBTECH 12 
(1994), 352-364. 

Furthermore, the term "and/or" when occurring herein includes the meaning of "and", 
"or" and "all or any other combination of the elements connected by said term". 

The present invention is further described by reference to the following non-limiting 
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figures and examples. 
The Figures show: 

Figure 1 shows on a microscopic scale typical events during the non-host 
interaction between Arabidopsis thaliana and Blumeria graminis f. sp. 
hordei (Bgh). 

A. Bgh spores efficiently adhere, germinate and develop appressoria on 
Arabidopsis leaves. Most aborted penetration attempts correlate with a 
local cell wall remodeling (24 h p. inoc). 

B. Successful penetration and haustorium initial formation. Encasement 
in callose terminates the fungal development (48 h p. inoc). 

C. Thickening of the complete cell wall and typical whole cell 
autofluorescence in a successfully penetrated epidermis cell (72 h p. 
inoc). 

Figure 2 illustrates the mutant phenotype of pen 2 5 d p. inoc Fluorescence 
microscopy reveals areas of intense autofluorescence on the Bgh 
challenged pen2 plant at a low magnification (8 x), indicative of a 
significantly increased frequency (approx. 30 %) of single epidermal 
cells having undergone an hypersensitive response (HR), as revealed 
at a higher magnification (100 x). 

Figure 3 shows a comparative cytological time course analysis of single Bgh-Col- 
3fpen2 interaction sites. A significant higher penetration success is 
invariably ensued by a cell death response on pen2 plants. 



Figure 4 illustrates the course of pathogen growth (Bgh) on the epidermis of a 
pen2 mutant leaf. 

A. Mature haustorium formation with typical digit-like protrusions in a 
successfully penetrated pen2 epidermis cell. Fluorescence microscopy 
reveals an incomplete encapsulation (24 h p. inoc). 
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B. Finally all established haustoria become encased in a thick layer of 
callose and polyphenolic compounds (48 h p. inoc). 

C. Successful penetration invariably leads to a hypersensitive cell death 
response (72 h p. inoc). 

Figure 5 proves that pen2 mutant plants are penetrated by Phytophthora 
infestans to a significantly higher extent than corresponding wild-type 
plants. 

A On Col-3 plants the oomycete Phythophthora infestans exhibits only 

limited superficial growth and almost no successful penetration. 

B On pen2 mutant plants a significantly enhanced penetration success 

is observed. Fungal infection structures become encased in cell wall like 

material. 

Figure 6 gives a comparative cytological analysis of Col-3 and pen2 challenged 
with different avirulent fungal pathogens (72 h p. inoc). Challenge with 
the hemibiotrophic rice pathogen P. grisea gives rise to a significantly 
higher frequency of cell wall remodeling in per?2 plants. 
Wheat powdery mildew inoculation results in enhanced penetration 
success and an increased cell death rate in pen2 plants. 

Figure 7 depicts a comparative FTIR spectrum plot showing relative absorbance 
changes in cell walls of Col-3 and pen2 plants. Significant relative 
absorbance differences between wildtype Col-3 and mutant pen2 are 
prominent in the range of carbohydrate and phenolics fingerprints. 

Figure 8 gives a schematic overview of the chromosomal region harboring the 
gene encoding pen2. SSLP and CAPS marker assisted map based 
cloning resulted in an 18 kb fragment of BAC F4I1 , flanked by CEREON 
data based CAPS markers SNP1 and SNP4, containing two predicted 
genes, both coding for putative family 1 glycohydrolases of which At2g 
44490 showed to encode PEN2. 
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Figure 9 presents the nucleotide and deduced amino acid sequence of the PEN2 
cDNA. 

The glycosyl hydrolase family 1 N-terminal signature is boxed in dark 
gray and the putative C-terminal transmembrane domain boxed in light 
gray. The G/A144 transition site in the mutant pen2 is underlined. The 
two catalytic glutamates are boxed in blue, highly conserved residues 
involved in glucose binding in green and residues involved in 
determining substrate specificity in red. 

Figure 10 shows a phylogenetic analysis of all 47 Arabidopsis family 1 glycoside 
hydrolases. It assigns PEN2 (At2g44490/F4l 1 .30) to a cluster of 11 
proteins, some of which might potentially be involved in plant defense 
mechanisms and/or senescence. 



Figure 11 depicts the plasmid map of the binary plant expression vector pamMCS 
for use in plant transformation. 

Figure 12 shows a sequence alignment of the PEN2 amino acid sequence with 
amino acid sequences from related family 1 glycoside hydrolases, 
showing residues relevant for catalysis and substrate specificity. 
ATHcab75923, closest Arabidopsis homologue of PEN2; ZMGIul, 
maize (Zea mays L. ) Glu1; ZMGIu2, maize Glu2; SBDhrl, sorghum 
(sorghum bicolor) Dhr1; SAMyr, white mustard (sinapis alba) 
myrosinase. The alignment shows the two catalytic glutamates (*), 
highly conserved residues involved in glucose binding (+) and residues 
involved in aglycdne binding (#). The N-terminal signature of family 1 |3- 
glycosidases is boxed. Regions of significant sequence similarity are 
shaded. 



The following Examples serve to further illustrate the invention. 
In the Examples the following materials and methods were used. 
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Molecular biological techniques 

Unless stated otherwise in the Examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook and Russell 
(2001), Molecular Cloning: A Laboratory Manual, CSH Press, Cold Spring 
Harbor, NY, USA or in Volumes 1 and 2 of Ausubel et al. (1994), Current 
Protocols in Molecular Biology, Current Protocols. Standard materials and 
methods for plant molecular work are described in Plant Molecular Biology 
Labfase (1993) by R.D.D. Cray, jointly published by BIOS Scientific 
Publications Ltd (UK) and Blackwell Scientific Publications (UK). 

Plant and fungal material and growth conditions 

An EMS mutated M2 population of Col-3, gl1 purchased from Lehle Seeds 

(LEHLE SEEDS 1102 South Industrial Blvd. Suite D Round Rock TX 78681 

USA; Cat.-# M2E-01 A-05) was used for the mutant screen. 

Plants were grown in Percival chambers (AR75L3) at 22 °C and 70 % humidity 

with a 16 h photoperiod. Light intensity was > 250 pE/m2 per sec for all 

experiments. 

Blumeria graminis f.sp hordei K1 was maintained on barley (Hordeum vulgare) 
line Sultan-5. 

Mutant screen 

3-4 week old Arabidopsis plants were inoculated by placing 16 pots (9x9 cm) 
each containing 16 plants into a settling tower and tapping six heavily infected 
barley pots over the top of the tower. After 5 min, the pots were returned to the 
Percival growth chambers. 5 days post inoculation plants were screened with 
a Leica stereomicroscope MZFLIII with a GFP1 filter set for aberrant enhanced 
autofluorescence patterns indicative of enhanced cell death of epidermal cells 
due to successful fungal ingress. 
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Example 1: 

Cytological studies on the interaction between Arabidopsis and the non-host 

pathogen Blumeria graminis (Bgh) 

As a first step, plant/fungus interaction was analyzed at the cytological level. Bgh 
spore inoculation of Arabidopsis ecotype Col-3 (gl1) plants showed complete 
immunity and lack of macroscopically visible plant responses even at late time points 
after inoculation (seven days after inoculation). Light microscopic analysis revealed 
successful germination of fungal spores on the leaf surface, efficient differentiation of 
Bgh germ tubes and appressorium formation, i.e. infection structures known to be 
critical for penetration of the fungus through the plant cell wall. Unlike compatible 
interactions on barley leaves, most sporelings failed to successfully penetrate 
epidermal cell walls at sites of attempted ingress (>98%). At failed penetration sites, 
host cells remained alive and typically responded to attempted fungal ingress by the 
formation of round shaped cell wall appositions, containing autofluorescent 
compounds, directly beneath a fungal appressorium. However, the remaining 
sporelings (<2%) successfully penetrated the cell wall beneath appressoria, indicated 
by the presence of haustorial initials within single epidermal cells (Fig. 1). Haustorium 
maturation ceased in most cases prematurely and haustorium containing host 
epidermal cells invariably died within 72 hours after inoculation as indicated by a 
whole cell yellow autofluorescence under UV light irradiation (Fig. 1). This cell death 
response may reflect activation of plant defense as a consequence of invasive fungal 
growth. Since both localized cell wall apposition formation and epidermal cell death 
are also implicated in Bgh defense following attack of its natural host, barley, it may 
be concluded that the nonhost Arabidopsis is capable to trigger similar cellular 
defense reactions. It was speculated that the low incidence of fungal cell wall 
penetration into Arabidopsis epidermal cells and the coincident formation of cell wall 
appositions at failed penetration sites might reflect the presence of effective 
(prehaustorial) nonhost resistance in Col-3 {gl1) wild type plants, preventing a switch 
from surface to invasive fungal growth. Taken together, non-host resistance of 
Arabidopsis to Bgh appears to be a predominantly prehaustorial, active defense 
response manifested at the level of cell wall prenetration. 
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Example 2: 

Isolation of Arabidopsis mutants with aberrant infection phenotypes to Bgh 



The information obtained from failed infection attempts of fungal sporelings on wild 
type Arabidopsis (see Example 1) could be used to devise a fluorescence-based 
screening procedure that uses plant autofluorescence as read out to identify 
Arabidopsis mutants exhibiting aberrant infection phenotypes. One could reason that 
genetic defects in prehaustorial nonhost defense would lead to an increased 
haustorium index. Since intracellular growth of the nonhost fungus triggers host cell 
death (see above), enhanced epidermal cell death rates in prehaustorial defense 
mutants may be expected. Autofluorescence-associated cell death of epidermal cells 
can be easily detected by irradiating spore-inoculated Arabidopsis plants with UV- 
light and examining leaves using a fluorescence stereomicroscope. Col-3 (gl1) wild 
type plants show even after high density spore inoculation only occasional yellow 
autofluorescence in the epidermal cell layer at five days after inoculation (Fig. 2). 
This stereomicroscope-based assay was then used to screen 26,260 M2 plants of an 
EMS mutagenized Col-3 (gl1) population for mutants exhibiting a higher index of 
epidermal whole cell autofluorescence compared to wild type plants. Amongst 
several mutant candidates, a heritable mutant could be isolated which has been 
designated pen2 (Fig. 2). Quantitative cytological analysis of single interaction sites 
revealed in pen2 mutants about 15-fold enhanced penetration frequencies compared 
to Col-3 plants, thereby increasing the haustorial index from 2 to approximately 30% 
(Fig. 3). The cytological analysis frequently showed fully differentiated Bgh haustoria 
within penetrated pen2 epidermal cells 24 h after spore inoculation, which became 
normally encapsulated by cell wall-like material at -48 h. Similar to Col-3 wild type 
plants, a cell death response ensued in haustorium containing pen2 epidermal cells 
between 24 and 48 h post inoculation (Fig. 3 and 4). 



+ 
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Example 3: 

pen2 plants reveal compromised nonhost resistance to different fungal 

pathogens 

To find out whether pen2 plants were compromised in defense to other nonhost 
pathogens, wild type and pen2 plants were challenged with Pyricularia grisea, 
Blumeria graminis f sp tritici {Bgt, wheat powdery mildew), and Phytophthora 
infestans. In either of the tested plant fungus interactions, evidence was found for 
compromised nonhost resistance in pen2 plants (Fig. 5 and 6). For example, 
challenge with Bgt or P. infestans showed at interaction sites on pen2 plants 
strikingly enhanced penetration frequencies that are accompanied by elevated cell 
death frequencies. This is remarkable because P. infestans is an oomycete and Bgt 
and Bgh are ascomycete fungi, indicating impaired nonhost resistance in pen2 to 
different classes of pathogens. Interestingly, Pyricularia grisea failed to trigger any 
microscopically visible cellular responses in Col-3 plants despite its ability to 
differentiate melanized appressoria for cell wall penetration on the leaf surface. In 
pen2 plants, in about 10% of the interaction sites, cell wall apposition formation was 
observed which is indistinguishable from the round shaped wall appositions seen in 
response to attempted Bgh penetration. Likewise, the frequency of wall apposition 
formation in response to Bgt challenge is dramatically enhanced in pen2 plants 
compared to Cdl-3 (63% and 9% respectively). Collectively, these data point to a 
dual role of PEN2 in static and inducible defense to various non-host fungal 
pathogens. Dependent on the particular non-host pathogen, lack of PEN2 can 
enhance the frequency of inducible cellular defense (wall apposition formation) as 
consequence of an impaired static barrier and/or increase the incidence of failed 
inducible defense coincident with host cell wall penetration. 

Example 4: 

FTIR analysis suggests a role for PEN2 in cell wall architecture 

In view of the enhanced cell wall penetration frequencies of several non-host fungal 
pathogens on pen2 plants, it was interesting to investigate a potential role of PEN2 in 
cell wall architecture. Fourier transformed infrared spectroscopy (FTIR) of non- 
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infected wild type and pen2 leaf samples (Fig. 7) was performed. FTIR is a rapid and 
non-invasive spectroscopic method that can quantitatively detect a range of chemical 
groups and bonds in fixed plant specimens. The difference FTIR spectrum between 
wild type and pen2 leaves indicates significant constitutive changes in cell wall 
architecture of pen2 plants. This is consistent with the interpretation that pen2 plants 
are impaired in static pathogen defense that may be critical during attempted host 
cell wall penetration. 

Example 5: 
Map-based isolation of PEN2 

) 

An F2 mapping population was generated by crosses between pen2 and Landsberg 
(La) wild-type plants. Bgh inoculation experiments of individual F2 plants showed that 
pen2 segregated as monogenic recessive trait. DNA marker analysis of bulked F2 
segregants suggested that pen2 maps close to the telomere on the short arm of 
chromosome 2. By using CAPS and SSLP-type DNA markers, pen2 could be 
located in a 7.4 cM target interval bordered by markers F18019 and F4I18 (see 
Figure 8). Molecular analysis of 864 F2 individuals from the mapping population 
identified a total of 24 recombinants within the 7.4 cM target interval. Scoring of 
infection phenotypes of the 24 recombinant plants upon Bgh challenge enabled 
further delimitation of pen2 to an interval of approximately 18 kb harboring two 
predicted ORFs (At2g44480 and At2g44490; see GenBank/EMBL database entry 
No. AC004521). Both deduced gene products show significant homology to family 1 
glycosyl hydrolases. RT-PCR of leaf RNA samples resulted in cDNA transcripts for 
At2g44490 only. Sequencing of pen2 derived At2g44490 cDNAs (and genomic 
sequences) showed in comparison to wild type cDNAs a single G to A transition at 
position 144, generating a stop codon and a predicted truncated peptide of 47 aa. No 
sequence polymorphism was found upon comparison of At2g44480 genomic 
sequences derived from wild type and pen2 mutant DNA. Collectively, these data 
made it possible to identify the pen2 gene (At2g44490) within the genetically and 
physically delimited target interval. The RT-PCR derived PEN2 cDNA sequence and 
the deduced PEN2 protein sequence are shown in Fig. 9. 
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Glycoside hydrolases catalyze the hydrolysis of glycosidic bonds in oligosaccharides, 
polysaccharides and conjugates between glucosides and a non-carbohydrate moiety. 
They occur in all living organisms and are classified into 82 families (http://afmb.cnrs- 
mrs.fr/-cazv/CAZY/ ; Henrissat, Biochem. J. 316 (1996), 695-696). Due to the 
presence of a specific motif (family 1 N-terminal signature; see Figure 9), PEN2 is 
assigned to family 1 glycoside hydrolases. Known enzymatic activities of family 1 
enzymes range from p-glucosidase, p-galactosidase, 6-phopho-p-galactosidase, 6- 
phospho-P-glucosidase, lactase-phlorizin hydrolase, (3-mannosidase, and myrosinase 
activities. A total of 282 different family 1 glycoside hydrolase proteins are at present 
classified in this group, including 106 plant sequences of which 47 are Arabidopsis 
thaliana sequences. PEN2 shows highest homology to p-glucosidases that 
constitute a major group among family 1 glycoside hydrolases. These enzymes 
hydrolyze either O-linked or S-linked p-glucosidic bonds. Substrates for these 
proteins are extraordinary diverse and include hormone glucosides, flavonol 
glucosides, cyanogenic glucosides, and glucosinolates. The aglycon product of the 
hydrolyzed substrates can serve a multitude of functions including growth and 
development (Selmar et al., 1987; Brzobohaty et al., 1993; Dietz et al., 2000), cell 
wall catabolism (Leah et al., 1995; Gerardi et al., 2001), lignification 
(Dharmawardhana et al., 1995 and 1999), and defense (Zheng and Poulton, 1995; 
Rsak et al., 2000). 

A phylogenetic analysis using Clustal W algorithm of all known 47 Arabidopsis p- 
glucosidases identifies several major branches and some outgroups (Figure 10). 
Interestingly, PEN2 outgroups together with another family member (T2O9.100) in a 
major branch comprising a total of 11 p-glucosidases (Fig. 10). The fact that the 
closest relative shares only 55.2% sequence identity (and 70.5% similarity) with 
PEN2 underlines the unusual sequence divergence of p-glucosidases within a 
species. 

Example 6: 

Overexpressing PEN2 in transgenic Arabidopsis plants 

Transgenic Arabidopsis thaliana PEN2 overexpressing lines are generated by fusing 
the 35S promoter upstream of the PEN2 coding region, followed by pA35S terminator 
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sequences. For this purpose the binary plasmid DNA vector pamMCS is used (Fig. 
11). 

Agrobacterium-mediated flower dip transformation and kanamycin selection is used 
to identify transgenic lines (Clough, Plant Journal. 16 (1998), 735-743). Immunoblots 
from tissue extracts of individual transgenic lines are probed with a PEN2 specific 
antiserum to select overexpressing PEN2 lines. The transgenic plants will be tested 
for enhanced disease resistance to several pathogens, including Blumeria graminis, 
Erysiphe cichoracearum, Peronospora parasitica, Pseudomonas syringae, Fusarium 
oxysporum, Botrytis cinerea, and Phytophthora infestans. Similar experiments will be 
conducted to generate and isolate PEN2 overexpressing lines in closely related plant 
species (e.g. Brassica) or more distantly related species including tomato and potato 
and will be tested for enhanced disease resistant phenotypes. 
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CLAIMS 

1 . A polynucleotide selected from the group consisting of 

(a) polynucleotides comprising a nucleotide sequence encoding a 
polypeptide with the amino acid sequence of SEQ ID NO:2; 

(b) polynucleotides comprising the nucleotide sequence shown in SEQ ID 
NO:1; 

(c) polynucleotides comprising a nucleotide sequence encoding a fragment 
of the polypeptide encoded by a polynucleotide of (a) or (b) f wherein 
said nucleotide sequence encodes a protein having (3-glucosidase 
activity; 

(d) polynucleotides comprising a nucleotide sequence the complementary 
strand of which hybridizes to the polynucleotide of any one of (a) to (c), 
wherein said nucleotide sequence encodes a protein having p- 
glucosidase activity; and 

(e) polynucleotides comprising a nucleotide sequence that deviates from 
the nucleotide sequence defined in (d) by the degeneracy of the genetic 
code; 

2. The polynucleotide of claim 1 which is DNA or RNA. 

3. A recombinant nucleic acid molecule comprising the polynucleotide of claim 1 
or 2. 

4. The recombinant nucleic acid molecule of claim 3 further comprising 
expression control sequences operably linked to said polynucleotide. 

5. A vector comprising a polynucleotide of claim 1 or 2 or the recombinant nucleic 
acid molecule of claim 3 or 4. 

6. The vector of claim 5 further comprising expression control sequences 
operably linked to said polynucleotide. 
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7. A method for producing genetically engineered host cells comprising 
introducing the polynucleotide of claim 1 or 2, the recombinant nucleic acid 
molecule of claim 3 or 4 or the vector of claim 5 or 6 into a host cell. 



8. A host cell which is genetically engineered with the polynucleotide of claim 1 or 
2, the recombinant nucleic acid molecule of claim 3 or 4 or the vector of claim 
5 or 6 or obtainable by the method of claim 7. 

9. The host cell of claim 8 which is a bacterial, yeast, fungus, plant or animal cell. 

10. A method for the production of a polypeptide encoded by a polynucleotide of 
claim 1 or 2 in which the host cell of claim 8 or 9 is cultivated under conditions 
allowing for the expression of the polypeptide and in which the polypeptide is 
isolated from the cells and/or the culture medium. 

11. A polypeptide encoded by the polynucleotide of claim 1 or 2 or obtainable by 
the method of claim 10. 

12. An antibody specifically recognizing the polypeptide of claim 1 1 . 

13. A method for producing a transgenic plant comprising the steps of 

(a) introducing the polynucleotide of claim 1 or 2, the recombinant nucleic 
acid molecule of claim 3 or 4 or the vector of claim 5 or 6 into the 
genome of a plant cell; and 

(b) regenerating the cell of (a) to a transgenic plant. 



A transgenic plant or plant tissue comprising the plant cells of claim 9 or 
obtainable by the method of claim 13. 

A transgenic plant which shows an increased activity of the polypeptide 
encoded by the polynucleotide of claim 1 or 2 compared to a corresponding 
wild-type plant. 
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16. The transgenic plant of claim 14 or 15 which, upon an increased activity of the 
protein encoded by the polynucleotide of claim 1 or 2, shows an increased 
resistance against a plant pathogen to which a corresponding wild-type plant is 
susceptible. 

17. Propagation material or harvestable parts of the transgenic plant of any one of 
claims 14 to 16 comprising plant cells of claim 9. 

18. A method for conferring pathogen resistance or increased pathogen resistance 
to a plant comprising the step of providing a transgenic plant in which the 
activity of the polypeptide encoded by the polynucleotide of claim 1 or 2 is 
increased. 

19. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) contacting a candidate compound with said polypeptide under 
conditions where said polypeptide is active; and 

(b) determining whether said candidate compound is hydrolyzed by said 
polypeptide. 

20. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) providing a 3-dimensional structure model of said polypeptide; and 

(b) determining the structure of a substrate that fits into the 3-dimensional 
structure model of (a). 

21. A method for identifying a compound that is hydrolyzed by a polypeptide 
encoded by the polynucleotide of claim 1 or 2 comprising the steps of 

(a) providing a mutant protein of said polypeptide the catalytic activity of 
which is abolished without loosing substrate binding activity; 

(b) contacting a candidate compound with said mutant protein; and 

(c) determining whether the candidate compound is bound by said mutant 
protein. 
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22. The method of any one of claims 19 to 21 furthermore comprising the step of 
determining whether the identified compound or a hydrolysis product thereof is 
capable of inducing or enhancing a defence response against a pathogen in a 
plant. 

23. A method for preparing a plant protection composition comprising the steps of 
the method of claims 19, 20, 21 and/or 22 and furthermore the step of 
formulating the identified compound or a hydrolysis product thereof in a form 
suitable for administering to plants. 

24. A compound, a hydrolysis product thereof or a plant protection composition 
identified or obtained by the method of any one of claims 18 to 23. 

25. A kit comprising the polynucleotide of claim 1 or 2, the recombinant nucleic 
acid molecule of claim 3 or 4, the vector of claim 5 or 6, the polypeptide of 
claim 11, the antibody of claim 12 or the compound or a hydrolysis product 
thereof of claim 24. 

26. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6, of the host cell of claim 8 
or 9, of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for identifying a compound that 
is hydrolyzed by a polypeptide encoded by said polynucleotide. 

27. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6, of the host cell of claim 8 
or 9 or of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for the preparation of a plant 
protection composition. 

28. Use of the polynucleotide of claim 1 or 2, of the recombinant nucleic acid 
molecule of claim 3 or 4, of the vector of claim 5 or 6, of the host cell of claim 8 
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or 9 or of the polypeptide of claim 11, of the antibody of claim 12 or of the 
transgenic plant of any one of claims 14 to 17 for establishing or enhancing a 
pathogen resistance in plants. 



* 
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Abstract 

Disclosed are polynucleotides that encode a beta-glucosidase belonging to the plant 
protein class of family 1 glycoside hydrolases. Furthermore described are 
recombinant nucleic molecules and vectors containing these polynucleotides and 
host cells, preferably plant cells, that are genetically engineered with these 
polynucleotides. In addition, transgenic plants are disclosed which show an 
increased activity of the beta-glucosidase described, preferably leading to an 
improved resistance against plant pathogens. In another aspect, the present 
application discloses screening methods for identifying a substrate for said beta- 
glucosidase and methods for the production of plant protection compositions from 
such substrates or hydrolysis products thereof. Furthermore, methods and uses are 
disclosed which apply the polynucleotides encoding the beta-glucosidase described 
in order to improve resistance in plants against plant pathogens. 
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MAHLQRTFPTEMSKGRASFPK 21 
A TG GCA CA TCT TC AAAGA ACA TTTCCTA CT GAG ATGTCAA AAGGT AG AGC TA GC TTT CC AAA A 63 



G mmmmmmmmmmmm^mmzm « m v » B G a 42 

G GC TTT CT CTT TG GAACT GCT TC AT CTT CT TAT CAGT ACG AA GGA GC AGTGA ATGAAGG TGC G 126 

RGQ5 VWDH FSNRPPHRI SDSS 63 

A G A GGA CA AAG CG TGT G£G AT CA TT TCT CC AAC AG GT TTC CT CAC AG AATCA GT GAT TC TAG C 189 

DGNV AVDF Y H R Y KEDIK RMKD 84 

G AC GGA AACGT TGCCG TT GATTTCT ACC AT CGTTA CA AGG AA GAT AT TAAGA GAATGAAGGA T 252 

INMDSFRL SIAWPRVLPYGKR 105 

ATA AAC AT GGA TTCGT TTCGGCTTT CCA TT GCT TG GC CAC GA GTTCT ACC TT ATGGC AA AAG G 315 

DRGVSEEG IKFYNDVIDELLA 126 

G AT AG A GG AGT T A GT G AA GAA GG AA TTA AG TTT TA CA ATG AT GTT AT TGA TG AA CTC TT AGC C 3 78 

NEIT PLVT IFHWDIPQDLEDE 147 

A AT GAA AT CAC TC CTC TT GTT AC TA TCT TTCAT TG GG ACA TA CCA CA GGA TC TT G AA GA TGA A 4 41 

YGGF L S E Q IIDDFR DYA SLCF 168 

T AC GGC GG TTT TC TAA GC GAGCAGA TTA TAGAT GA CT TCAGAGAC TA TGC GAGTCTC TG CTT C 504 

E R FG DRVS LWCTMnHpWVYSV 189 

G AG AGA TT TGG GG ACAGA GTG AG TC TGTGG TGC AC AA TGA AT GAG CC GTG GG TC TAC AG TGT C 567 

■ G Y D T G R B APGRCSKYVNGAS 210 

G CG GGC TA TGA C A CAG GA AGG AA AG CGC CAGG A CG GT GCT CC AAG TA TGT TA AT GGC GC TAG T 6 30 

VAGMSGYE AY IVSHNMLLAHA 2 31 

G TT GCT GG AATGTCGG GA TAC GAGG CAT AT ATT GTGAGCC AT AAC AT GCT TC TAGCG CA CGC A 693 

EAVEVFRK CDHI KNGQI GIAK 2 52 

G AAGCAGT GGA AG TGT TT AGA AA AT GTG AC CAT AT TA AAA AC GGACA AAT TG GG ATT GC GCA T 7 56 

NPLW YEPY DPSD PDDVEGCNR 273 

A AT CC A CT TTG G T ACG AG CCA TA TG ATC CG AGT GATC CAG AT GAT GT CG A AG GA TGT AA TCG A 819 

AMDF MLGW HQH PTACGD Y ? E T 294 

G CT ATG GA CTT C A TGC TT GGT TGGC ATC AG CAT CC GACTG CT TGT GG AG ACT AT CCA GA AAC G 882 

MKK5 VGDRLPS FTPEQS KKLI 315 

ATG AAG AA ATC AG TTG GAGAT AG AT TAC CG AGT TT TACAC CA GAA CA ATC TAAGAAACT TAT A 945 

GSCDYVGI NYYSSLFVKSIKH 336 

G GC TCT TG CGA T T ACG TT GGT AT AA ACT AC TAT AG CT CGC TT TTC GT GAAGA GT ATC AA ACA C 10 08 

VD PT QPTW RTDQGVDWMKTNI 3 57 

G TG GAT CC TAC GC AAC CT ACT TG GA GAA CT GAC CAAGGCG TT GAT TG GATGA AA ACC AA CAT A 10 7 1 

DGKQIAKQGGSE Bl SFTYPTGL 378 

G AT GGG AA ACA AA TAG CA AAACA AG GAG GA TCAGAGT GGAGT TTC AC ATATC CAACAGG ACT C 11 3 4 

R.N I L KYVK KTYG NPPIL ItHn 399 

AGA AAC AT TTT GA AGT AT GTG AA AA AAACT TAT GG CA ATC CT CCC AT TCTCA TA ACT GA AAA C 1197 

GYGE VAEQ SQSLYMYN P SIDT 4 20 

G GG TAT GG TGA AG TAG CG GAACAGAGTC AG AGT CT TT ATATG TAC AATCCTT CA ATC GA CAC A 12 60 

ERLE YIEG HI HA IHQ AI HEDG 441 

GAG AGA TT GGA G T ACA TT GAA GG AC ATA TC CAC GC TA TTC AT CAA GC CAT CC AT GAA G A TGG A 13 23 

VRVEGYYV Hi S L L D N F S G Y 4 52 

G TAAGA GT GGA AG GTT AT TAC GT AT GGT CA TTG CTAG ATA ACTTCGAGTGGA AC AGT GG ATA T 1386 

GVRYGLYY IDYKDGLRRYPKH 4 83 

G GT GTG AG ATA TG GT T TA TAT TA CA TTG AT TAC AA AG ATG GG CTT AG ACG AT ACCCG AA AAT G 14 49 

SALWLXEF LRFDQEDDS STSK 504 

TCG GCG TT ATG G T TG A AA GAG TT CT TGA GG TTT GA TC AAG AA G AC GA TTC TT CG ACG TC TAA G 1512 

KEEK KESYGKQLLHSVQDSQF 525 

A AAGAA GA GAAGA AAG AG AGC TA TG GAA AA CAG TT AT TGC AT TCTGT TCA GG AC AGT CA ATT T 1575 



VHSIKDs[g A* L P~ >"VT tt S L , F V V 'si 546 
G TTCAT TC GAT T A AAG AC AGTGG TG CGT TACCT GC GG TTT TG GGG AG CTTGT TC GTT GTGTC T 16 38 



l-at^vH n T-SLFFKGANN+ 560 
G CA ACT GTTGGTACTT CT CTG TT CT TCA AA GGA GC TA ATAAT TGA 1683 
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SEQUENCE LISTING 

<110> Max-Planck-Gesellschaf t zur Forderung der Wissenschaf ten e.V. 
<120> Polynucleotides encoding a beta-glucosidase and uses thereof 
<130> G1309PCT 
<160> 4 

<170> Patentln version 3.1 

<210> 1 

<211> 1683 

<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> CDS 

<222> (1)..(1680) 

<223> 



<400> 1 

atg gca cat ctt caa aga aca ttt cct act gag atg tea aaa ggt aga 48 

Met Ala His Leu Gin Arg Thr Phe Pro Thr Glu Met Ser Lys Gly Arg 

15 10 15 



get age ttt cca aaa ggc ttt etc ttt gga act get tea tct tct tat 
Ala Ser Phe Pro Lys Gly Phe Leu Phe Gly Thr Ala Ser Ser Ser Tyr 
20 25 30 



gat cat ttc tec aac agg ttt cct cac aga ate agt gat tct age gac 
Asp His Phe Ser Asn Arg Phe Pro His Arg lie Ser Asp Ser Ser Asp 
5 0 55 60 



96 



cag tac gaa gga gca gtg aat gaa ggt gcg aga gga caa age gtg tgg 144 
Gin Tyr Glu Gly Ala Val Asn Glu Gly Ala Arg Gly Gin Ser Val Trp 
35 40 45 



192 



2/7 



gga aac gtt gcc gtt gat ttc tac cat cgt tac aag gaa gat att aag 240 
Gly Asn Val Ala Val Asp Phe Tyr His Arg Tyr Lys Glu Asp lie Lys 
65 70 75 80 

aga atg aag gat ata aac atg gat teg ttt egg ctt tec att get tgg 288 
Arg Met Lys Asp lie Asn Met Asp Ser Phe Arg Leu Ser lie Ala Trp 
85 90 95 

cca cga gtt eta cct tat ggc aaa agg gat aga gga gtt agt gaa gaa 33 6 

Pro Arg Val Leu Pro Tyr Gly Lys Arg Asp Arg Gly Val Ser Glu Glu 
100 105 110 

gga att aag ttt tac aat gat gtt att gat gaa etc tta gcc aat gaa 384 
Gly lie Lys Phe Tyr Asn Asp Val lie Asp Glu Leu Leu Ala Asn Glu 
115 120 125 

ate act cct ctt gtt act ate ttt cat tgg gac ata cca cag gat ctt 432 
lie Thr Pro Leu Val Thr lie Phe His Trp Asp lie Pro Gin Asp Leu 
130 135 140 

gaa gat gaa tac ggc ggt ttt eta age gag cag att ata gat gac ttc 480 
Glu Asp Glu Tyr Gly Gly Phe Leu Ser Glu Gin lie lie Asp Asp Phe 
145 150 155 160 

aga gac tat gcg agt etc tgc ttc gag aga ttt ggg gac aga gtg agt 52 8 

Arg Asp Tyr Ala Ser Leu Cys Phe Glu Arg Phe Gly Asp Arg Val Ser 
165 170 175 

ctg tgg tgc aca atg aat gag ccg tgg gtc tac agt gtc gcg ggc tat 57 6 

Leu Trp Cys Thr Met Asn Glu Pro Trp Val Tyr Ser Val Ala Gly Tyr 
180 185 190 

gac aca gga agg aaa gcg cca gga egg tgc tec aag tat gtt aat ggc 624 
Asp Thr Gly Arg Lys Ala Pro Gly Arg Cys Ser Lys Tyr Val Asn Gly 
195 200 205 

get agt gtt get gga atg teg gga tac gag gca tat att gtg age cat 672 
Ala Ser Val Ala Gly Met Ser Gly Tyr Glu Ala Tyr lie Val Ser His 
210 215 220 

aac atg ctt eta gcg cac gca gaa gca gtg gaa gtg ttt aga aaa tgt 72 0 

Asn Met Leu Leu Ala His Ala Glu Ala Val Glu Val Phe Arg Lys Cys 
225 230 235 240 

gac cat att aaa aac gga caa att ggg att gcg cat aat cca ctt tgg 768 
Asp His He Lys Asn Gly Gin He Gly He Ala His Asn Pro Leu Trp 
245 250 255 

tac gag cca tat gat ccg agt gat cca gat gat gtc gaa gga tgt aat 816 
Tyr Glu Pro Tyr Asp Pro Ser Asp Pro Asp Asp Val Glu Gly Cys Asn 
260 265 270 

cga get atg gac ttc atg ctt ggt tgg cat cag cat ccg act get tgt 864 
Arg Ala Met Asp Phe Met Leu Gly Trp His Gin His Pro Thr Ala Cys 
275 280 285 

gga gac tat cca gaa acg atg aag aaa tea gtt gga gat aga tta ccg 912 
Gly Asp Tyr Pro Glu Thr Met Lys Lys Ser Val Gly Asp Arg Leu Pro 
290 "* 295 300 
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agt ttt aca cca gaa caa tct aag aaa ctt ata ggc tct tgc gat tac 960 
Ser Phe Thr Pro Glu Gin Ser Lys Lys Leu lie Gly Ser Cys Asp Tyr 
305 310 315 320 

gtt ggt ata aac tac tat age teg ctt ttc gtg aag agt ate aaa cac 1008 
Val Gly lie Asn Tyr Tyr Ser Ser Leu Phe Val Lys Ser lie Lys His 
325 330 335 

gtg gat cct acg caa cct act tgg aga act gac caa ggc gtt gat tgg 1056 
Val Asp Pro Thr Gin Pro Thr Trp Arg Thr Asp Gin Gly Val Asp Trp 
340 345 350 

atg aaa acc aac ata gat ggg aaa caa ata gca aaa caa gga gga tea 1104 
Met Lys Thr Asn lie Asp Gly Lys Gin lie Ala Lys Gin Gly Gly Ser 
355 360 365 

gag tgg agt ttc aca tat cca aca gga etc aga aac att ttg aag tat 1152 
Glu Trp Ser Phe Thr Tyr Pro Thr Gly Leu Arg Asn lie Leu Lys Tyr 
370 375 380 

gtg aaa aaa act tat ggc aat cct ccc att etc ata act gaa aac ggg 1200 
Val Lys Lys Thr Tyr Gly Asn Pro Pro lie Leu lie Thr Glu Asn Gly 
385 390 395 400 

tat ggt gaa gta gcg gaa cag agt cag agt ctt tat atg tac aat cct 1248 
Tyr Gly Glu Val Ala Glu Gin Ser Gin Ser Leu Tyr Met Tyr Asn Pro 
405 410 415 

tea ate gac aca gag aga ttg gag tac att gaa gga cat ate cac get 129 6 

Ser lie Asp Thr Glu Arg Leu Glu Tyr lie Glu Gly His lie His Ala 
420 425 430 

att cat caa gee ate cat gaa gat gga gta aga gtg gaa ggt tat tac 1344 
He His Gin Ala He His Glu Asp Gly Val Arg Val Glu Gly Tyr Tyr 
435 440 445 

gta tgg tea ttg eta gat aac ttc gag tgg aac agt gga tat ggt gtg 1392 
Val Trp Ser Leu Leu Asp Asn Phe Glu Trp Asn Ser Gly Tyr Gly Val 
450 455 460 

aga tat ggt tta tat tac att gat tac aaa gat ggg ctt aga cga tac 1440 
Arg Tyr Gly Leu Tyr Tyr He Asp Tyr Lys Asp Gly Leu Arg Arg Tyr 
465 470 475 480 

ccg aaa atg teg gcg tta tgg ttg aaa gag ttc ttg agg ttt gat caa 1488 
Pro Lys Met Ser Ala Leu Trp Leu Lys Glu Phe Leu Arg Phe Asp Gin 
485 490 495 

gaa gac gat tct teg acg tct aag aaa gaa gag aag aaa gag age tat 153 6 

Glu Asp Asp Ser Ser Thr Ser Lys Lys Glu Glu Lys Lys Glu Ser Tyr 
500 505 510 

gga aaa cag tta ttg cat tct gtt cag gac agt caa ttt gtt cat teg 1584 
Gly Lys Gin Leu Leu His Ser Val Gin Asp Ser Gin Phe Val His Ser 
515 520 525 

att aaa gac agt ggt gcg tta cct gcg gtt ttg ggg age ttg ttc gtt 1632 
He Lys Asp Ser Gly Ala Leu Pro Ala Val Leu Gly Ser Leu Phe Val 
530 535 540 



/ 
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gtg tct gca act gtt ggt act tct ctg ttc ttc aaa gga get aat aat 1680 
Val Ser Ala Thr Val Gly Thr Ser Leu Phe Phe Lys Gly Ala Asn Asn 
545 550 555 560 

tga 1683 

<210> 2 

<211> 560 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 2 

Met Ala His Leu Gin Arg Thr Phe Pro Thr Glu Met Ser Lys Gly Arg 
15 10 15 



Ala Ser Phe Pro Lys Gly Phe Leu Phe Gly Thr Ala Ser Ser Ser Tyr 
20 25 30 



Gin Tyr Glu Gly Ala Val Asn Glu Gly Ala Arg Gly Gin Ser Val Trp 
35 40 45 



Asp His Phe Ser Asn Arg Phe Pro His Arg lie Ser Asp Ser Ser Asp 
50 55 60 



Gly Asn Val Ala Val Asp Phe Tyr His Arg Tyr Lys Glu Asp lie Lys 
65 70 75 80 



Arg Met Lys Asp lie Asn Met Asp Ser Phe Arg Leu Ser lie Ala Trp 
85 90 95 



Pro Arg Val Leu Pro Tyr Gly Lys Arg Asp Arg Gly Val Ser Glu Glu 
100 105 110 



Gly lie Lys Phe Tyr Asn Asp Val lie Asp Glu Leu Leu Ala Asn Glu 
115 120 125 



lie Thr Pro Leu Val Thr lie Phe His Trp Asp lie Pro Gin Asp Leu 
130 135 140 



Glu Asp Glu Tyr Gly Gly Phe Leu Ser Glu Gin He He Asp Asp Phe 
145 150 155 160 



J 
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Arg Asp Tyr Ala Ser Leu Cys Phe Glu Arg Phe Gly Asp Arg Val Ser 
165 170 175 



Leu Trp Cys Thr Met Asn Glu Pro Trp Val Tyr Ser Val Ala Gly Tyr 
180 185 190 



Asp Thr Gly Arg Lys Ala Pro Gly Arg Cys Ser Lys Tyr Val Asn Gly 
195 200 205 



Ala Ser Val Ala Gly Met Ser Gly Tyr Glu Ala Tyr lie Val Ser His 
210 215 220 



Asn Met Leu Leu Ala His Ala Glu Ala Val Glu Val Phe Arg Lys Cys 
225 230 235 240 



Asp His lie Lys Asn Gly Gin lie Gly lie Ala His Asn Pro Leu Trp 
245 250 255 



Tyr Glu Pro Tyr Asp Pro Ser Asp Pro Asp Asp Val Glu Gly Cys Asn 
260 265 270 



Arg Ala Met Asp Phe Met Leu Gly Trp His Gin His Pro Thr Ala Cys 
275 280 285 



Gly Asp Tyr Pro Glu Thr Met Lys Lys Ser Val Gly Asp Arg Leu Pro 
290 295 300 



Ser Phe Thr Pro Glu Gin Ser Lys Lys Leu lie Gly Ser Cys Asp Tyr 
305 310 315 320 



Val Gly lie Asn Tyr Tyr Ser Ser Leu Phe Val Lys Ser lie Lys His 
325 330 335 



Val Asp Pro Thr Gin Pro Thr Trp Arg Thr Asp Gin Gly Val Asp Trp 
340 345 350 



Met Lys Thr Asn lie Asp Gly Lys Gin lie Ala Lys Gin Gly Gly Ser 
355 360 365 



Glu Trp Ser Phe Thr Tyr Pro Thr Gly Leu Arg Asn lie Leu Lys Tyr 
370 375 380 



Val Lys Lys Thr Tyr Gly Asn Pro Pro lie Leu 
385 390 395 



lie Thr Glu Asn Gly 
400 
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Tyr Gly Glu Val Ala Glu Gin Ser Gin Ser Leu Tyr Met Tyr Asn Pro 
405 410 415 



Ser lie Asp Thr Glu Arg Leu Glu Tyr lie Glu Gly His lie His Ala 
420 425 430 



He His Gin Ala He His Glu Asp Gly Val Arg Val Glu Gly Tyr Tyr 
435 440 445 



Val Trp Ser Leu Leu Asp Asn Phe Glu Trp Asn Ser Gly Tyr Gly Val 
450 455 460 



Arg Tyr Gly Leu Tyr Tyr He Asp Tyr Lys Asp Gly Leu Arg Arg Tyr 
465 470 475 480 



Pro Lys Met Ser Ala Leu Trp Leu Lys Glu Phe Leu Arg Phe Asp Gin 
485 490 495 



Glu Asp Asp Ser Ser Thr Ser Lys Lys Glu Glu Lys Lys Glu Ser Tyr 
500 505 510 



Gly Lys Gin Leu Leu His Ser Val Gin Asp Ser Gin Phe Val His Ser 
515 520 525 



He Lys Asp Ser Gly Ala Leu Pro Ala Val Leu Gly Ser Leu Phe Val 
530 535 540 



Val Ser Ala Thr Val Gly Thr Ser Leu Phe Phe Lys Gly Ala Asn Asn 
545 550 555 560 



<210> 3 

<211> 5 

<212> PRT 

<213> Artificial 

<400> 3 

Thr Phe Asn Glu Pro 
1 5 



<210> 4 
<211> 5 
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<212> PRT 

<213> Artificial 

<220> 

<221> MISC_FEATURE 

<222> (1)..(1) 

<223> "X" may be lie or Val 

<400> 4 



Xaa Thr Glu Asn Gly 
1 5 



